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NUCLEIC ACID AND AMINO ACID SEQUENCES RELATING TO 
HELICOBACTER PYLORI AND VACCINE COMPOSITIONS THEREOF 

Background of the Invention 
5 Helicobacter pylori is a gram-negative, S-shaped, microaerophilic bacterium that 

was discovered and cultured from a human gastric biopsy specimen. (Warren, JR. and 
B. Marshall, (1983) Lancet I: 1273-1275; and Marshall et al., (1984) Microbios Lett 25: 
83-88). H pylori has been strongly linked to chronic gastritis and duodenal ulcer 
disease. (Rathbone et. al. s (1986) Gut 27: 635-641). Moreover, evidence is 

10 accumulating for an etiologic role of//, pylori in nonulcer dyspepsia, gastric ulcer 
disease, and gastric adenocarcinoma. (Blaser M. J., (1993) Trends Microbiol. J,: 255- 
260). Transmission of the bacteria occurs via the oral route, and the risk of infection 
increases with age. (Taylor, D.N. and M. J. Blaser, (1991) Epidemiol. /tevJ3: 42-50). 
H pylori colonizes the human gastric mucosa, establishing an infection that usually 

1 5 persists for decades. Infection by H. pylori is prevalent worldwide. Developed 
countries have infection rates over 50% of the adult population, while developing 
countries have infection rates reaching 90% of the adults over the age of 20. (Hopkins 
R. J. and J, G. Morris (1994) Am. J. Med. 97: 265-277). 

The bacterial factors necessary for colonization of the gastric environment, and 

20 for virulence of this pathogen, are poorly understood. Examples of the putative 
virulence factors include the following: urease, an enzyme that may play a role in 
neutralizing gastric acid pH (Eaton et al., (1991) Infect. Immunol. 59: 2470-2475; 
Ferrero, R.L. and A. Lee (1991) Microb. Ecol. Hlth. Dis. 4: 121-134; Labigne et al., 
(1991) 7. Bacteriol. 173: 1920-1931); the bacterial flagellar proteins responsible for 

25 motility across the mucous layer. (Hazell et al., (1986) J. Inf. Dis. 1 53 : 658-663; Leying 
et al., (1992) Mol Microbiol. 6: 2863-2874; and Haas et al., (1993) Mol Microbiol 8: 
753-760); Vac A, a bacterial toxin that induces the formation of intracellular vacuoles in 
epithelial cells (Schmitt, W. and R. Haas, (1994) Molecular Microbiol. 12(21 : 307-319); 
and several gastric tissue-specific adhesins. (Boren et al., (1993) Science 262: 1892- 

30 1 895; Evans et al., (1993) J. Bacteriol. 175: 674-683; and Falk et ah, (1993) Proc. Natl 
Acad Sci USA 90: 2035-203). 

Numerous therapeutic agents are currently available that eradicate H pylori 
infections in vitro. (Huesca et. al., (1993) Zbl Bakt. 280: 244-252; Hopkins, R. J. and J. 
G. Morris, supra). However, many of these treatments are suboptimally effective in vivo 

35 because of bacterial resistance, altered drug distribution, patient non-compliance or poor 
drug availabilty. (Hopkins, R. J. and J. G. Morris, supra). Treatment with antibiotics 
combined with bismuth are part of the standard regime used to treat H. pylori infection. 
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(Malfertheiner, P. and J. E. Dominguez-Munoz (1993) Clinical Therapeutics \ 5 Supp. 
B: 37-48). Recently, combinations of a proton pump inhibitors and a single antibiotic 
have been shown to ameliorate duodenal ulcer disease. (Malfertheiner, P. and J. E. 
Dominguez-Munoz supra). However, methods employing antibiotic agents can have the 
5 problem of the emergence of bacterial strains which are resistant to these agents. 

(Hopkins, R. J. and J. G. Morris, supra). These limitations demonstrate that new more 
effective methods are needed to combat H. pylori infections in vivo. In particular, the 
design of new vaccines that may prevent infection by this bacterium is highly desirable. 

Summary of the Invention 

10 This invention relates to novel genes, e.g., genes encoding polypeptides such as 

bacterial surface proteins, from the organism Helicobacter pylori (H pylori), and other 
related genes, their products, and uses thereof. The nucleic acids and peptides of the 
present invention have utility for diagnostic and therapeutics for H pylori and other 
Helicobacter species. They can also be used to detect the presence of H pylori and 

15 other Helicobacter species in a sample; and for use in screening compounds for the 
ability to interfere with the.//, pylori life cycle or to inhibit H pylori infection. More 
specifically, this invention features compositions of nucleic acids corresponding to 
entire coding sequences of H pylori proteins, including surface or secreted proteins or 
parts thereof, nucleic acids capable of binding mRNA from H pylori proteins to block 

20 protein translation, and methods for producing H pylori proteins or parts thereof using 
peptide synthesis and recombinant DNA techniques. This invention also features 
antibodies and nucleic acids useful as probes to detect H pylori infection. In addition, 
vaccine compositions and methods for the protection or treatment of infection by H 
pylori are within the scope of this invention. 

25 Detailed Description of the Drawings 

Figure 1 depicts an amino acid sequence alignment of five H pylori proteins 
(depicted in the single letter amino acid code and designated by their amino acid 
Sequence ID Numbers; shown N-terminal to C-terminal, left to right). 

Figure 2 depicts the N-terminal portion of three H. pylori proteins (depicted in 
30 the single letter amino acid code and designated by their amino acid Sequence ID 
Numbers; shown N-terminal to C-terminal, left to right). 
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Detailcd Description of the Invention 

In one aspect, the invention features a recombinant or substantially pure 
preparation of K pylori polypeptide of SEQ ID NO: 98. The invention also includes 
substantially pure nucleic acid encoding an K pylori polypeptide of SEQ ID NO: 98, 
5 such nucleic acid is contained in SEQ ID NO: 1 . The K pylori polypeptide sequences 
of the invention described herein are contained in the Sequence Listing, and the nucleic 
acids encoding K pylori polypeptides of the invention are contained in the Sequence 
Listing. 

In another aspect, the invention features a substantially pure nucleic acid 
1 0 encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 99, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 2. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 100, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 3. 
1 5 In another aspect, the invention features a substantially pure nucleic acid 

encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 101, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 4. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 102, 
20 such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 5. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 103, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 6. 

In another aspect, the invention features a substantially pure nucleic acid 
25 encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 104, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 7. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 105, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 8. 
30 In another aspect, the invention features a substantially pure nucleic acid 

encoding an H. pylori polypeptide having a*i amino acid sequence of SEQ ID NO: 106, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 9. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 107, 
35 such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 10. 
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In another aspect, the invention features a substantially pure nucleic acid 
encoding an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 108, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 1 1 . 

In another aspect, the invention features a substantially pure nucleic acid 
5 encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 109, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 12. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 110, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 13. 
10 In another aspect, the invention features a substantially pure nucleic acid 

encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 111, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 14. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 12, 
1 5 such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 15. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 113, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 16. 

In another aspect, the invention features a substantially pure nucleic acid 
20 encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 114, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 17. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 15, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 18. 
25 In another aspect, the invention features a substantially pure nucleic acid 

encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 116, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 19. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 117, 
30 such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 20. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 1 8, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 21. 

In another aspect, the invention features a substantially pure nucleic acid 
35 encoding an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 1 9, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 22. 
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In another aspect, the invention features a substantially pure nucleic acid 
encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 120, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 23. 

In another aspect, the invention features a substantially pure nucleic acid 
5 encoding an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 121, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 24. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 122, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 25. 
1 0 In another aspect, the invention features a substantially pure nucleic acid 

encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 123, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 26. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 124, 
1 5 such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 27. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 125, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 28. 

In another aspect, the invention features a substantially pure nucleic acid 
20 encoding an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 126, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 29. 

* 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 127, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 30. 
25 In another aspect, the invention features a substantially pure nucleic acid 

encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 128, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 31 . 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 129, 
30 such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 32. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 130, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 33. 

In another aspect, the invention features a substantially pure nucleic acid 
35 encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 131, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 34. 
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In another aspect, the invention features a substantially pure nucleic acid 
encoding an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 32, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 35. 

In another aspect, the invention features a substantially pure nucleic acid 
5 encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 133, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 36. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 134, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 37. 
10 In another aspect, the invention features a substantially pure nucleic acid 

encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 135, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 38. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 136, 
15 such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 39. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 137, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 40. 

In another aspect, the invention features a substantially pure nucleic acid 
20 encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 138, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 41 . 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 139, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 42. 
25 In another aspect, the invention features a substantially pure nucleic acid 

encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 140, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO:43. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 141, 
30 such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 44. 

In another aspect, the invention features a substantially pi re nucleic acid 
encoding an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 142, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 45. 

In another aspect, the invention features a substantially pure nucleic acid 
35 encoding an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 143, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 46. 
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In another aspect, the invention features a substantially pure nucleic acid 
encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 144, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 47. 

In another aspect, the invention features a substantially pure nucleic acid 
5 encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 145, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 48. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 146, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 49. 
10 In another aspect, the invention features a substantially pure nucleic acid 

encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 147, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 50. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 148, 
1 5 such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 5 1 . 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 149, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 52. 

In another aspect, the invention features a substantially pure nucleic acid 
20 encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 50, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 53. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 151, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 54. 
25 In another aspect, the invention features a substantially pure nucleic acid 

encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 152, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 55. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 153, 
30 such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 56. 

In another aspect, the irve i ition features a substantially pure nucleic acid 
encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 154, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 57. 

In another aspect, the invention features a substantially pure nucleic acid 
35 encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 155, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 58. 
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In another aspect, the invention features a substantially pure nucleic acid 
encoding an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 56, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 59. 

In another aspect, the invention features a substantially pure nucleic acid 
5 encoding an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 157, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 60. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 158, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 61. 
10 In another aspect, the invention features a substantially pure nucleic acid 

encoding an H, pylori polypeptide having an amino acid sequence of SEQ ID NO: 159, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 62. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 160, 
1 5 such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 63. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 161, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 64. 

In another aspect, the invention features a substantially pure nucleic acid 
20 encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 162, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 65. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 163, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 66. 
25 In another aspect, the invention features a substantially pure nucleic acid 

encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 164, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 67. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 165, 
30 such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 68. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 166, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 69. 

In another aspect, the invention features a substantially pure nucleic acid 
35 encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 167, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 70. 
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In another aspect, the invention features a substantially pure nucleic acid 
encoding an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 168, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 71 . 

In another aspect, the invention features a substantially pure nucleic acid 
5 encoding an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 169, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 72. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 170, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 73. 
10 In another aspect, the invention features a substantially pure nucleic acid 

encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 171, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 74. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 172, 
1 5 such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 75. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 173, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 76. 

In another aspect, the invention features a substantially pure nucleic acid 
20 encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 74, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 77. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 175, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 78. 
25 In another aspect, the invention features a substantially pure nucleic acid 

encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 176, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 79. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 177, 
30 such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 80. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 178, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 81. 

In another aspect, the invention features a substantially pure nucleic acid 
35 encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 79, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 82. 



WO 98/24475 



PCT/US97/22104 



-10- 



In another aspect, the invention features a substantially pure nucleic acid 
encoding an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 180, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 83. 

In another aspect, the invention features a substantially pure nucleic acid 
5 encoding an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 181, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 84. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 82, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 85. 
10 In another aspect, the invention features a substantially pure nucleic acid 

encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 183, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 86. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 184, 
15 such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 87. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 185, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 88. 

In another aspect, the invention features a substantially pure nucleic acid 
20 encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 86, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 89. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 187, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 90. 
25 In another aspect, the invention features a substantially pure nucleic acid 

encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 88, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 91. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 189, 
30 such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 92. 

In another aspect, the invention features a substa .itially pure nucleic acid 
encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 190, 
such as a nucleic, acid comprising a nucleotide sequence of SEQ ID NO: 93. 

In another aspect, the invention features a substantially pure nucleic acid 
35 encoding an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 191, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 94. 
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In another aspect, the invention features a substantially pure nucleic acid 
encoding an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 192, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 95. 

In another aspect, the invention features a substantially pure nucleic acid 
5 encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 193, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 96. 

In another aspect, the invention features a substantially pure nucleic acid 
encoding an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 194, 
such as a nucleic acid comprising a nucleotide sequence of SEQ ID NO: 97. 
10 In another aspect, the invention features an isolated nucleic acid having a 

nucleotide sequence encoding an H. pylori polypeptide at least about 60% homologous 
to an amino acid sequence selected from the group consisting of SEQ ID NO: 98-SEQ 
ID NO: 194. In a preferred embodiment, the isolated nucleic acid includes a nucleotide 
sequence selected from the group consisting of SEQ ID NO: 1-SEQ ID NO: 97, or a 
1 5 complement thereof. 

In another aspect, the invention features an isolated nucleic acid having a 
nucleotide sequence encoding an H. pylori polypeptide selected from the group 
consisting of SEQ ID NO: 98-SEQ ID NO: 194. 

In another aspect, the invention features an isolated nucleic acid which encodes 
20 an H. pylori polypeptide, having a nucleotide sequence at least about 60% homologous 
to a nucleotide sequence selected from the group consisting of SEQ ID NO: 1-SEQ ID 
NO: 97, or a complement thereof. 

In another aspect, the invention features an isolated nucleic acid molecule 
encoding an H. pylori polypeptide, having a nucleotide sequence which hybridizes under 
25 stringent hybridization conditions to a nucleic acid molecule having the nucleotide 
sequence selected from the group consisting of SEQ ID NO: 1-SEQ ID NO: 97, or a 
complement thereof. 

In another aspect, the invention features an isolated nucleic acid having a 
nucleotide sequence of at least 8 nucleotides in length, wherein the sequence hybridizes 
30 under stringent hybridization conditions to a nucleic acid having a nucleotide sequence 
selected from the groun consisting of SEQ ID NO: 1-SEQ ID NO: 97, or a complement 
thereof. 

Particularly preferred is an isolated nucleic acid having a nucleotide sequence 
encoding an H. pylori cell envelope polypeptide or a fragment thereof, the nucleic acid 
35 selected from the group consisting of SEQ ID NO: 63, SEQ ID NO: 7, SEQ ID NO: 8, 
SEQ ID NO: 9, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 23, SEQ ID NO: 24, 
SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 50, SEQ ID NO: 51, SEQ ID NO: 61, 
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SEQ ID NO: 79, SEQ ID NO: 80, SEQ ID NO: 84, SEQ ID NO: 85, SEQ ID NO: 91, 
SEQ ID NO: 94, SEQ ID NO: 5, SEQ ID NO: 1 1, SEQ ID NO: 26, SEQ ID NO: 36, 
SEQ ID NO: 42, SEQ ID NO: 52, SEQ ID NO: 22, SEQ ID NO: 29, SEQ ID NO: 30, 
SEQ ID NO: 65, SEQ ID NO: 66, SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 17, 
5 SEQ ID NO: 1 8, SEQ ID NO: 19, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 38, 
SEQ ID NO: 39, SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 6, SEQ ID NO: 34, SEQ 
ID NO: 35, SEQ ID NO: 60, SEQ ID NO: 69, and SEQ ID NO: 83, or a complement 
thereof. 

In one embodiment, the H. pylori cell envelope polypeptide or a fragment thereof 

10 is an H. pylori flagella-associated polypeptide or a fragment thereof encoded by a 

nucleic acid having a nucleotide sequence of SEQ ID NO: 63, or a complement thereof. 

In another embodiment, the K pylori cell envelope polypeptide or a fragment 
thereof is an H. pylori inner membrane polypeptide or a fragment thereof encoded by a 
nucleic acid selected from the group consisting of SEQ ID NO: 48, SEQ ID NO: 49, 

15 SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 43, SEQ ID NO: 44, 
SEQ ID NO: 38, and SEQ ID NO: 39, or a complement thereof 

In another embodiment, the H. pylori inner membrane polypeptide or a fragment 
thereof is an H. pylori polypeptide or a fragment thereof involved in transport encoded 
by a nucleic acid selected from the group consisting of SEQ ID NO: 48, SEQ ID NO: 

20 49, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 43, and SEQ ID 
NO: 44, or a complement thereof. 

In another embodiment, the K pylori cell envelope polypeptide or a fragment 
thereof is an H. pylori outer membrane polypeptide or a fragment thereof encoded by a 
nucleic acid selected from the group consisting of SEQ ID NO: 7, SEQ ID NO: 8, SEQ 

25 ID NO: 9, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID 
NO: 27, SEQ ID NO: 28, SEQ ID NO: 50, SEQ ID NO: 51, SEQ ID NO: 61, SEQ ID 
NO: 79, SEQ ID NO: 80, SEQ ID NO: 84, SEQ ID NO: 85, SEQ ID NO: 91, SEQ ID 
NO: 94, SEQ ID NO: 5, SEQ ID NO: 1 1, SEQ ID NO: 26, SEQ ID NO: 36, SEQ ID 
NO: 42, SEQ ID NO: 52, SEQ ID NO: 22, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID 

30 NO: 65, and SEQ ID NO: 66, or a complement thereof 

In another embodiment, the H. pylori outer membrane polypeptide or a fragment 
thereof is an H. pylori polypeptide having a terminal phenylalanine residue or a 
fragment thereof encoded by a nucleic acid selected from the group consisting of SEQ 
ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 1 1, SEQ ID NO: 13, SEQ ID 

35 NO: 14, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID 
NO: 28, SEQ ID NO: 36, SEQ ID NO: 42, SEQ ID NO: 50, SEQ ID NO: 51, SEQ ID 
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NO: 52, SEQ ID NO: 61, SEQ ID NO: 79, SEQ ID NO: 80, SEQ ID NO: 84, SEQ ID 
NO: 85, SEQ ID NO: 91 , and SEQ ID NO: 94, or a complement thereof. 

In another embodiment, H. pylori outer membrane polypeptide or a fragment 
thereof is an H. pylori polypeptide having a terminal phenylalanine residue and a C- 
5 terminal tyrosine cluster or a fragment thereof encoded by a nucleic acid selected from 
the group consisting of SEQ ID NO: 1 1, SEQ ID NO: 26, SEQ ID NO: 36, SEQ ID NO: 
42, and SEQ ID NO: 52, or a complement thereof. 

Particularly preferred is an isolated nucleic acid having a nucleotide sequence 
encoding an H. pylori cell envelope polypeptide or a fragment thereof selected from the 

10 group consisting of SEQ ID NO: 160, SEQ ID NO: 104, SEQ ID NO: 105, SEQ ID NO: 
106, SEQ ID NO: 1 10, SEQ ID NO: 1 1 1, SEQ ID NO: 120, SEQ ID NO: 121, SEQ ID 
NO: 124, SEQ ID NO: 125, SEQ ID NO: 147, SEQ ID NO: 148, SEQ ID NO: 158, SEQ 
ID NO: 176, SEQ ID NO: 177, SEQ ID NO: 181, SEQ ID NO: 182, SEQ ID NO: 188, 
SEQ ID NO: 191, SEQ ID NO: 102, SEQ ID NO: 108, SEQ ID NO: 123, SEQ ID NO: 

15 133, SEQ ID NO: 139, SEQ ID NO: 149, SEQ ID NO: 119, SEQ ID NO: 126, SEQ ID 
NO: 127, SEQ ID NO: 162, SEQ ID NO: 163, SEQ ID NO: 145, SEQ ID NO: 146, SEQ 
ID NO: 1 14, SEQ ID NO: 115, SEQ ID NO: 116, SEQ ID NO: 140, SEQ ID NO: 141, 
SEQ ID NO: 135, SEQ ID NO: 136, SEQ ID NO: 98, SEQ ID NO: 99, SEQ ID NO: 
103, SEQ ID NO: 131, SEQ ID NO: 132, SEQ ID NO: 157, SEQ ID NO: 166, and SEQ 

20 ID NO: 1 80. 

In another embodiment, the H. pylori cell envelope polypeptide or a fragment 
thereof is an H. pylori flagella-associated polypeptide or a fragment thereof having an 
amino acid sequence of SEQ ID NO: 160. 

In another embodiment, the H. pylori cell envelope polypeptide or a fragment 
25 thereof is an H. pylori inner membrane polypeptide or a fragment thereof selected from 
the group consisting of SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 1 14, SEQ ID 
NO: 115, SEQ ID NO: 116, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID NO: 135, and 
SEQ ID NO: 136. 

In another embodiment, the H. pylori inner membrane polypeptide or a fragment 
30 thereof is an H. pylori polypeptide or a fragment thereof involved in transport selected 
from the group consisting of SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NC: 1 14, 
SEQ ID NO: 115, SEQ ID NO: 116, SEQ ID NO: 140, and SEQ ID NO: 141. 

In another embodiment, the H. pylori cell envelope polypeptide or a fragment 
thereof is an H. pylori outer membrane polypeptide or a fragment thereof selected from 
35 the group consisting of SEQ ID NO: 104, SEQ ID NO: 105, SEQ ID NO: 106, SEQ ID 
NO: 110, SEQ ID NO: 111, SEQ ID NO: 120, SEQ ID NO: 121, SEQ ID NO: 124, SEQ 
ID NO: 125, SEQ ID NO: 147, SEQ ID NO: 148, SEQ ID NO: 158, SEQ ID NO: 176, 
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SEQ ID NO: 177, SEQ ID NO: 181, SEQ ID NO: 182, SEQ ID NO: 188, SEQ ID NO: 
191, SEQ ID NO: 102, SEQ ID NO: 108, SEQ ID NO: 123, SEQ ID NO: 133, SEQ ID 
NO: 139, SEQ ID NO: 149, SEQ ID NO: 119, SEQ ID NO: 126, SEQ ID NO: 127, SEQ 
ID NO: 162, and SEQ ID NO: 163. 
5 In another embodiment, the //. pylori outer membrane polypeptide or a fragment 

thereof is an H. pylori polypeptide having a terminal phenylalanine residue or a 
fragment thereof selected from the group consisting of SEQ ID NO: 104, SEQ ID NO: 
105, SEQ ID NO: 106, SEQ ID NO: 108, SEQ ID NO: 110, SEQ ID NO: 111, SEQ ID 
NO: 120, SEQ ID NO: 121, SEQ ID NO: 123,SEQ ID NO: 124, SEQ ID NO: 125, SEQ 

10 ID NO: 133, SEQ ID NO: 139, SEQ ID NO: 147, SEQ ID NO: 148, SEQ ID NO: 149, 
SEQ ID NO: 158, SEQ ID NO: 176, SEQ ID NO: 177, SEQ ID NO: 181, SEQ ID NO: 
182, SEQ ID NO: 188, and SEQ ID NO: 191. 

In another embodiment, the H. pylori outer membrane polypeptide or a fragment 
thereof is an H. pylori polypeptide having a terminal phenylalanine residue and a C- 

1 5 terminal tyrosine cluster or a fragment thereof selected from the group consisting of 
SEQ ID NO: 108, SEQ ID NO: 123, SEQ ID NO: 133, SEQ ID NO: 139, and SEQ ID 
NO: 149. 

Particularly preferred is an isolated nucleic acid having a nucleotide sequence 
encoding an H. pylori cytoplasmic polypeptide or a fragment thereof, wherein the 

20 nucleic acid is selected from the group consisting of SEQ ID NO: 57, SEQ ID NO: 58, 
SEQ ID NO: 86, SEQ ID NO: 87, SEQ ID NO: 88, SEQ ID NO: 89, SEQ ID NO: 92, 
and SEQ ID NO: 93, or a complement thereof. 

In one embodiment, the K pylori cytoplasmic polypeptide or a fragment thereof 
is an H. pylori polypeptide or a fragment thereof involved in mRNA translation, wherein 

25 the nucleic acid is selected from the group consisting of SEQ ID NO: 57 and SEQ ID 
NO: 58, or a complement thereof. 

In another embodiment, the H. pylori cytoplasmic polypeptide or a fragment 
thereof is an H. pylori polypeptide or a fragment thereof involved in genome replication, 
transcription, recombination and repair, wherein the nucleic acid is selected from the 

30 group consisting of SEQ ID NO: 86, SEQ ID NO: 87, or a complement thereof. 

Particularly preferred is an isolated ruc .eic acid having a nucleotide sequence 
encoding an H. pylori cytoplasmic polypeptide or a fragment thereof selected from the 
group consisting of SEQ ID NO: 154, SEQ ID NO: 155, SEQ ID NO: 183, SEQ ID NO: 
184, SEQ ID NO: 185, SEQ ID NO: 186, SEQ ID NO: 189, and SEQ ID NO: 190. 

35 In one embodiment, the K pylori cytoplasmic polypeptide or a fragment thereof 

is an K pylori polypeptide or a fragment thereof involved in mRNA translation selected 
from the group consisting of SEQ ID NO: 154 and SEQ ID NO: 155. 
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In another embodiment, the H. pylori cytoplasmic polypeptide or a fragment 
thereof is an H. pylori polypeptide or a fragment thereof involved in genome replication, 
transcription, recombination and repair selected from the group consisting of SEQ ID 
NO: 183 and SEQ ID NO: 184. 
5 Particularly preferred is an isolated nucleic acid having a nucleotide sequence 

encoding an H. pylori secreted polypeptide or a fragment thereof, the nucleic acid 
selected from the group consisting of SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 10, 
SEQ ID NO: 12, SEQ ID NO: 20, SEQ ID NO: 25, SEQ ID NO: 3 1, SEQ ID NO: 32, 
SEQ ID NO: 45, SEQ ID NO: 46, SEQ ID NO: 53 SEQ ID NO: 64, SEQ ID NO: 67, 

10 SEQ ID NO: 70, SEQ ID NO: 77, SEQ ID NO: 78, SEQ ID NO: 81, SEQ ID NO: 82, 
SEQ ID NO: 90, SEQ ID NO: 95, and SEQ ID NO: 97, or a complement thereof. 

Particularly preferred is an isolated nucleic acid having a nucleotide sequence 
encoding an H. pylori secreted polypeptide or a fragment thereof selected from the group 
consisting of SEQ ID NO: 100, SEQ ID NO: 101, SEQ ID NO: 107, SEQ ID NO: 109, 

15 SEQ ID NO: 1 17, SEQ ID NO: 122, SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID NO: 
142, SEQ ID NO: 143, SEQ ID NO: 150 SEQ ID NO: 161, SEQ ID NO: 164, SEQ ID 
NO: 167, SEQ ID NO: 174, SEQ ID NO: 175, SEQ ID NO: 178, SEQ ID NO: 179, SEQ 
ID NO: 187, SEQ ID NO: 192, and SEQ ID NO: 194. 

Particularly preferred is an isolated nucleic acid having a nucleotide sequence 

20 encoding an H. pylori cellular polypeptide or a fragment thereof, the nucleic acid 

selected from the group consisting of SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 21, 
SEQ ID NO: 33, SEQ ID NO: 37, SEQ ID NO: 40, SEQ ID NO: 41 , SEQ ID NO: 47, 
SEQ ID NO: 54, SEQ ID NO: 55, SEQ ID NO: 56 SEQ ID NO: 59, SEQ ID NO: 62, 
SEQ ID NO: 68, SEQ ID NO: 71, SEQ ID NO: 72, SEQ ID NO: 73, SEQ ID NO: 74, 

25 SEQ ID NO: 75, SEQ ID NO: 76, and SEQ ID NO: 96, or a complement thereof. 

Particularly preferred is an isolated nucleic acid having a nucleotide sequence 
encoding an H. pylori cellular polypeptide or a fragment thereof selected from the group 
consisting of SEQ ID NO: 112, SEQ ID NO: 113, SEQ ID NO: 118, SEQ ID NO: 130, 
SEQ ID NO: 134, SEQ ID NO: 137, SEQ ID NO: 138, SEQ ID NO: 144, SEQ ID NO: 

30 151, SEQ ID NO: 1 52, SEQ ID NO: 1 53, SEQ ID NO: 1 56, SEQ ID NO: 1 59, SEQ ID 
NO: 1 65, SEO I'; NO: 1 68, SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 1 7 1 , SEQ 
ID NO: 172, SEQ ID NO: 173, and SEQ ID NO: 193. 

In another aspect, the invention features a probe having a nucleotide sequence 
consisting of at least 8 nucleotides of a nucleotide sequence selected from the group 

35 consisting of SEQ ID NO: 1 -SEQ ID NO: 97, or a complement thereof. 
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In another aspect, the invention features an isolated H. pylori polypeptide having 
an amino acid sequence at least about 60% homologous to an H. pylori polypeptide 
selected from the group consisting of SEQ ID NO: 98-SEQ ID NO: 194. 

In another aspect, the invention features an isolated K pylori polypeptide which 
5 is encoded by a nucleic acid having a nucleotide sequence at least about 60% 

homologous to a nucleotide sequence selected from the group consisting of SEQ ID NO: 
1-SEQ ID NO: 97. In one embodiment, the isolated H. pylori polypeptide is encoded by 
a nucleotide sequence selected from the group consisting of SEQ ID NO: 1-SEQ ID NO: 
97. 

10 In another aspect, the invention features an isolated H. pylori polypeptide which 

is encoded by a nucleic acid which hybridizes under stringent hybridization conditions 
to a nucleic acid selected from the group consisting of SEQ ID NO: 1-SEQ ID NO: 97, 
or a complement thereof. 

In another aspect, the invention features an isolated K pylori polypeptide having 

1 5 an amino acid sequence selected from the group consisting of SEQ ID NO: 97-SEQ ID 
NO: 194. 

Particularly preferred is an isolated H. pylori cell envelope polypeptide or a 
fragment thereof, wherein the polypeptide is selected from the group consisting of SEQ 
ID NO: 160, SEQ ID NO: 104, SEQ ID NO: 105, SEQ ID NO: 106, SEQ ID NO: 110, 

20 SEQ ID NO: 111, SEQ ID NO: 120, SEQ ID NO: 121, SEQ ID NO: 124, SEQ ID NO: 
125, SEQ ID NO: 147, SEQ ID NO: 148, SEQ ID NO: 158, SEQ ID NO: 176, SEQ ID 
NO: 177, SEQ ID NO: 181, SEQ ID NO: 182, SEQ ID NO: 188, SEQ ID NO: 191, SEQ 
ID NO: 102, SEQ ID NO: 108, SEQ ID NO: 123, SEQ ID NO: 133, SEQ ID NO: 139, 
SEQ ID NO: 149, SEQ ID NO: 119, SEQ ID NO: 126, SEQ ID NO: 127, SEQ ID NO: 

25 162, SEQ ID NO: 163, SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 114, SEQ ID 
NO: 115, SEQ ID NO: 116, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID NO: 135, SEQ 
ID NO: 136, SEQ ID NO: 98, SEQ ID NO: 99, SEQ ID NO: 103, SEQ ID NO: 131, 
SEQ ID NO: 132, SEQ ID NO: 157, SEQ ID NO: 166, and SEQ ID NO: 180. 

In one embodiment, the H. pylori cell envelope polypeptide or a fragment thereof 

30 is an K pylori flagella-associated polypeptide or a fragment thereof having an amino 
acid sequence of SEQ ID NO: 160. 

In another embodiment, the H. pylori cell envelope polypeptide or a fragment 
thereof is an K pylori inner membrane polypeptide or a fragment thereof selected from 
the group consisting of SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 114, SEQ ID 

35 NO: 115, SEQ ID NO: 116, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID NO: 135, and 
SEQ ID NO: 136. 
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In another embodiment, the H. pylori inner membrane polypeptide or a fragment 
thereof is an H. pylori polypeptide or a fragment thereof involved in transport selected 
from the group consisting of SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 1 14, 
SEQ ID NO: 1 15, SEQ ID NO: 116, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID NO: 
5 135, and SEQ ID NO: 136. 

In another embodiment, the H. pylori cell envelope polypeptide or a fragment 
thereof is an H. pylori outer membrane polypeptide or a fragment thereof selected from 
the group consisting of SEQ ID NO: 104, SEQ ID NO: 105, SEQ ID NO: 106, SEQ ID 
NO: 110, SEQ ID NO: 111, SEQ ID NO: 120, SEQ ID NO: 121, SEQ ID NO: 124, SEQ 

10 ID NO: 125, SEQ ID NO: 147, SEQ ID NO: 148, SEQ ID NO: 158, SEQ ID NO: 176, 
SEQ ID NO: 177, SEQ ID NO: 181, SEQ ID NO: 182, SEQ ID NO: 188, SEQ ID NO: 
191, SEQ ID NO: 102, SEQ ID NO: 108, SEQ ID NO: 123, SEQ ID NO: 133, SEQ ID 
NO: 139, SEQ ID NO: 149, SEQ ID NO: 1 19, SEQ ID NO: 126, SEQ ID NO: 127, SEQ 
ID NO: 162, and SEQ ID NO: 163. 

15 In another embodiment, the H. pylori outer membrane polypeptide or a fragment 

thereof is an H. pylori polypeptide having a terminal phenylalanine residue or a 
fragment thereof selected from the group consisting of SEQ ID NO: 104, SEQ ID NO: 
105, SEQ ID NO: 106, SEQ ID NO: 108, SEQ ID NO: 1 10, SEQ ID NO: 1 1 1, SEQ ID 
NO: 120, SEQ ID NO: 121, SEQ ID NO: 123, SEQ ID NO: 124, SEQ ID NO: 125, SEQ 

20 ID NO: 133, SEQ ID NO: 139, SEQ ID NO: 147, SEQ ID NO: 148, SEQ ID NO: 149, 
SEQ ID NO: 1 58, SEQ ID NO: 1 76, SEQ ID NO: 1 77, SEQ ID NO: 1 8 1 , SEQ ID NO: 
182, SEQ ID NO: 1 88, and SEQ ID NO: 191. 

In another embodiment, the H. pylori outer membrane polypeptide or a fragment 
thereof is an H. pylori polypeptide having a terminal phenylalanine residue and a C- 

25 terminal tyrosine cluster or a fragment thereof selected from the group consisting of 
SEQ ID NO: 108, SEQ ID NO: 123, SEQ ID NO: 133, SEQ ID NO: 139, and SEQ ID 
NO: 149. 

Particularly preferred is an isolated H. pylori cell envelope polypeptide or a 
fragment thereof, wherein the polypeptide is encoded by a nucleic acid selected from the 

30 group consisting of SEQ ID NO: 63, SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, 
SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 23, SEQ ID NO: 24. SEQ ID NO: 27, 
SEQ ID NO: 28, SEQ ID NO: 50, SEQ ID NO: 51, SEQ ID NO: 61, SEQ ID NO: 79, 
SEQ ID NO: 80, SEQ ID NO: 84, SEQ ID NO: 85, SEQ ID NO: 91, SEQ ID NO: 94, 
SEQ ID NO: 5, SEQ ID NO: 1 1, SEQ ID NO: 26, SEQ ID NO: 36, SEQ ID NO: 42, 

35 SEQ ID NO: 52, SEQ ID NO: 22, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 65, 
SEQ ID NO: 66, SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 17, SEQ ID NO: 18, 
SEQ ID NO: 19, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 38, SEQ ID NO: 39, 
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SEQ ID NO: 1 , SEQ ID NO: 2, SEQ ID NO: 6, SEQ ID NO: 34, SEQ ID NO: 35, SEQ 
ID NO: 60, and SEQ ID NO: 69, SEQ ID NO: 83. 

In one embodiment, the H. pylori cell envelope polypeptide or a fragment thereof 
is an H. pylori flagel la-associated polypeptide or a fragment thereof encoded by a 
5 nucleic acid having a nucleotide sequence of SEQ ID NO: 63. 

In another embodiment, the H. pylori cell envelope polypeptide or a fragment 
thereof is an H. pylori inner membrane polypeptide or a fragment thereof encoded by a 
nucleic acid selected from the group consisting of SEQ ID NO: 48, SEQ ID NO: 49, 
SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 43, SEQ ID NO: 44, 
10 SEQ ID NO: 38, and SEQ ID NO: 39. 

In another embodiment, the H. pylori inner membrane polypeptide or a fragment 
thereof is an H. pylori polypeptide or a fragment thereof involved in transport encoded 
by a nucleic acid selected from the group consisting of SEQ ID NO: 48, SEQ ID NO: 
49, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 43, and SEQ ID 
15 NO: 44. 

In another embodiment, the H. pylori cell envelope polypeptide or a fragment 
thereof is an H. pylori outer membrane polypeptide or a fragment thereof encoded by a 
nucleic acid selected from the group consisting of SEQ ID NO: 7, SEQ ID NO: 8, SEQ 
ID NO: 9, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID 

20 NO: 27, SEQ ID NO: 28, SEQ ID NO: 50, SEQ ID NO: 51, SEQ ID NO: 61, SEQ ID 
NO: 79, SEQ ID NO: 80, SEQ ID NO: 84, SEQ ID NO: 85, SEQ ID NO: 91, SEQ ID 
NO: 94, SEQ ID NO: 5, SEQ ID NO: 1 1, SEQ ID NO: 26, SEQ ID NO: 36, SEQ ID 
NO: 42, SEQ ID NO: 52, SEQ ID NO: 22, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID 
NO: 65, and SEQ ID NO: 66. 

25 In another embodiment, the H. pylori outer membrane polypeptide or a fragment 

thereof is an H. pylori polypeptide having a terminal phenylalanine residue or a 
fragment thereof encoded by a nucleic acid selected from the group consisting of SEQ 
ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 1 1, SEQ ID NO: 13, SEQ ID 
NO: 14, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID 

30 NO: 28, SEQ ID NO: 36, SEQ ID NO: 42, SEQ ID NO: 50, SEQ ID NO: 5 1 , SEQ ID 
NO: 52, SEQ ID NO: 61, SEQ ID NO 79, SEQ ID NO: 80, SEQ ID NO: 84, SEQ ID 
NO: 85, SEQ ID NO: 91, and SEQ ID NO: 94. 

In another embodiment, the H. pylori outer membrane polypeptide or a fragment 
thereof is an H. pylori polypeptide having a terminal phenylalanine residue and a C- 

35 terminal tyrosine cluster or a fragment thereof encoded by a nucleic acid selected from 
the group consisting of SEQ ID NO: 1 1 , SEQ ID NO: 26, SEQ ID NO: 36, SEQ ID NO: 
42, and SEQ ID NO: 52. 
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Particularly preferred is an isolated K pylori cytoplasmic polypeptide or a 
fragment thereof, wherein the polypeptide is selected from the group consisting of SEQ 
ID NO: 154, SEQ ID NO: 155, SEQ ID NO: 183, SEQ ID NO: 184, SEQ ID NO: 185, 
SEQ ID NO: 186, SEQ ID NO: 189, and SEQ ID NO: 190. 
5 In another embodiment, the H. pylori cytoplasmic polypeptide or a fragment 

thereof is an H. pylori polypeptide or a fragment thereof involved in mRNA translation 
selected from the group consisting of SEQ ID NO: 154 and SEQ ID NO: 155. 

In another embodiment, the K pylori cytoplasmic polypeptide or a fragment 
thereof is an H. pylori polypeptide or a fragment thereof involved in genome 

10 replication, transcription, recombination and repair selected from the group consisting of 
SEQ ID NO: 183 and SEQ ID NO: 184. 

Particularly preferred is an isolated H. pylori cytoplasmic polypeptide or a 
fragment thereof, wherein the polypeptide is encoded by a nucleic acid selected from the 
group consisting of SEQ ID NO: 57, SEQ ID NO: 58, SEQ ID NO: 86, SEQ ID NO: 87, 

15 SEQ ID NO: 88, SEQ ID NO: 89; SEQ ID NO: 92, and SEQ ID NO: 93. 

In one embodiment, the K pylori cytoplasmic polypeptide or a fragment thereof 
is an H. pylori polypeptide or a fragment thereof involved in mRNA translation, wherein 
the polypeptide is encoded by a nucleic acid selected from the group consisting of SEQ 
ID NO: 57, and SEQ ID NO: 58. 

20 In another embodiment, the H. pylori cytoplasmic polypeptide or a fragment 

thereof is an K pylori polypeptide or a fragment thereof involved in genome replication, 
transcription, recombination and repair, wherein the polypeptide is encoded by a nucleic 
acid selected from the group consisting of SEQ ID NO: 86 and SEQ ID NO: 87. 

Particularly preferred is an isolated H. pylori cellular polypeptide or a fragment 

25 thereof, wherein the polypeptide is selected from the group consisting of SEQ ID NO: 
112, SEQ ID NO: 113, SEQ ID NO: 118, SEQ ID NO: 130, SEQ ID NO: 134, SEQ ID 
NO: 137, SEQ ID NO: 138, SEQ ID NO: 144, SEQ ID NO: 151, SEQ ID NO: 152, SEQ 
ID NO: 153, SEQ ID NO: 156, SEQ ID NO: 159, SEQ ID NO: 165, SEQ ID NO: 168, 
SEQ ID NO: 169, SEQ ID NO: 170, SEQ ID NO: 171, SEQ ID NO: 172, SEQ ID NO: 

30 173, and SEQ ID NO: 193. 

F articularly preferred is an isolated H. pylori cellular polypeptide or a fragment 
thereof, wherein the polypeptide is encoded by a nucleic acid selected from the group 
consisting of SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 21, SEQ ID NO: 33, SEQ 
ID NO: 37, SEQ ID NO: 40, SEQ ID NO: 41, SEQ ID NO: 47, SEQ ID NO: 54, SEQ 

35 ID NO: 55, SEQ ID NO: 56 SEQ ID NO: 59, SEQ ID NO: 62, SEQ ID NO: 68, SEQ ID 
NO: 71, SEQ ID NO: 72, SEQ ID NO: 73, SEQ ID NO: 74, SEQ ID NO: 75, SEQ ID 
NO: 76, and SEQ ID NO: 96. 
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Particularly preferred is an isolated H. pylori secreted polypeptide or a fragment 
thereof, wherein the polypeptide is selected from the group consisting of SEQ ID NO: 
100, SEQ ID NO: 101, SEQ ID NO: 107, SEQ ID NO: 109, SEQ ID NO: 117, SEQ ID 
NO: 122, SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID NO: 142, SEQ ID NO: 143, SEQ 
5 ID NO: 150 SEQ ID NO: 161, SEQ ID NO: 164, SEQ ID NO: 167, SEQ ID NO: 174, 
SEQ ID NO: 175, SEQ ID NO: 178, SEQ ID NO: 179, SEQ ID NO: 187, SEQ ID NO: 
192, and SEQ ID NO: 194. 

Particularly preferred is an isolated H. pylori secreted polypeptide or a fragment 
thereof, wherein the polypeptide is encoded by a nucleic acid selected from the group 

10 consisting of SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID 
NO: 20, SEQ ID NO: 25, SEQ ID NO: 31, SEQ ID NO: 32, SEQ ID NO: 45, SEQ ID 
NO: 46, SEQ ID NO: 53 SEQ ID NO: 64, SEQ ID NO: 67, SEQ ID NO: 70, SEQ ID 
NO: 77, SEQ ID NO: 78, SEQ ID NO: 81, SEQ ID NO: 82, SEQ ID NO: 90, SEQ ID 
NO: 95, and SEQ ID NO: 97. 

15 In another aspect, the invention features a chimeric H. pylori polypeptide 

comprising at least two H. pylori polypeptides or fragments thereof, wherein the 
polypeptides are encoded by nucleic acid sequences selected from the group consisting 
of SEQ ID NO: 1 -SEQ ID NO:97. 

In another aspect, the invention features a chimeric K pylori polypeptide 

20 comprising at least two H. pylori polypeptides or fragments thereof, wherein the 

polypeptides are selected from the group consisting of SEQ ID NO:98-SEQ ID NO: 194. 

In another aspect, the invention features a fusion protein comprising an H. pylori 
polypeptide which comprises an amino acid sequence selected from the group consisting 
of SEQ ID NO: 98-SEQ ID NO: 194 operatively linked to a non-tf pylori polypeptide. 

25 In another aspect, the invention features a vaccine formulation for prophylactic 

or therapeutic treatment of an H. pylori infection comprising an effective amount of at 
least one isolated nucleic acid of the invention. 

In another aspect, the invention features a vaccine formulation for prophylactic 
or therapeutic treatment of an H. pylori infection comprising an effective amount of at 

30 least one K pylori polypeptide of the invention. 

Preferably, the vaccine formulation of the invention further includes a 
pharmaceutically acceptable carrier. In one embodiment, the pharmaceutically 
acceptable carrier includes an adjuvant. In another embodiment, the pharmaceutically 
acceptable carrier includes a delivery system, e.g., a live vector, e.g., a bacteria or a 

35 virus. In another embodiment, the pharmaceutically acceptable carrier includes both an 
adjuvant and a delivery system. 
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In another aspect, the invention features a method of treating or reducing a risk 
of H. pylori infection in a subject. The method includes administering to a subject a 
vaccine formulation of the invention, such that treatment or reduction of risk of H. pylori 
infection occurs. 

5 In another aspect, the invention features a method of producing a vaccine 

formulation of the invention. The method includes combining at least one isolated K 
pylori polypeptide or a fragment thereof selected from the group consisting of SEQ ID 
NO: 98-SEQ ID NO: 194 with a pharmaceutical ly acceptable carrier to thereby form a 
vaccine formulation. 

10 In another aspect, the invention features a method of producing a vaccine 

formulation of the invention. The method includes culturing a cell under condition that 
permit expression of an K pylori polypeptide or a fragment thereof selected from the 
group consisting of SEQ ID NO: 98-SEQ ID NO: 194; isolating the K pylori 
polypeptide from the cell; and combining at least one isolated H. pylori polypeptide or a 

1 5 fragment thereof with a pharmaceutical^ acceptable carrier to thereby form a vaccine 
formulation. 

In another aspect, the invention pertains to any individual H. pylori polypeptide 
member or nucleic acid encoding such a member from the above-identified groups of H. 
pylori polypeptides. 

20 In another aspect, the invention features nucleic acids capable of binding mRNA 

of H. pylori. Such nucleic acid is capable of acting as antisense nucleic acid to control 
the translation of mRNA of H. pylori. A further aspect features a nucleic acid which is 
capable of binding specifically to an H. pylori nucleic acid. These nucleic acids are also 
referred to herein as complements and have utility as probes and as capture reagents. 

25 In another aspect, the invention features an expression system comprising an 

open reading frame corresponding to H. pylori nucleic acid. The nucleic acid further 
comprises a control sequence compatible with an intended host. The expression system 
is useful for making polypeptides corresponding to K pylori nucleic acid. 

In another aspect, the invention features a cell transformed with the expression 

30 system to produce H. pylori polypeptides. 

In another aspect, the invention features a method of generating antibodies 
against H. pylori polypeptides which are capable of binding specifically to H. pylori 
polypeptides. Such antibodies have utility as reagents for immunoassays to evaluate the 
abundance and distribution of K /^/on-specific antigens. 

35 In another aspect, the invention features a method of generating vaccines for 

immunizing an individual against H. pylori. The vaccination method includes: 
immunizing a subject with at least one H. pylori polypeptide according to the present 
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invention, e.g., a surface or secreted polypeptide, or active portion thereof, and a 
pharmaceutically acceptable carrier. Such vaccines have therapeutic and/or prophylactic 
utilities. 

In another aspect, the invention provides a method for generating a vaccine 
5 comprising a modified immunogenic K pylori polypeptide, e.g., a surface or secreted 
polypeptide, or active portion thereof, and a pharmacologically acceptable carrier. 

In another aspect, the invention features a method of evaluating a compound, e.g. 
a polypeptide, e.g., a fragment of a host cell polypeptide, for the ability to bind an H. 
pylori polypeptide. The method includes: contacting the candidate compound with an 
10 H. pylori polypeptide and determining if the compound binds or otherwise interacts with 
an H. pylori polypeptide. Compounds which bind H. pylori are candidates as activators 
or inhibitors of the bacterial life cycle. These assays can be performed in vitro or in 
vivo. 

In another aspect, the invention features a method of evaluating a compound, e.g. 

15 a polypeptide, e.g., a fragment of a host cell polypeptide, for the ability to bind an K 
pylori nucleic acid, e.g., DNA or RNA. The method includes: contacting the candidate 
compound with an H. pylori nucleic acid and determining if the compound binds or 
otherwise interacts with an H. pylori polypeptide. Compounds which bind H. pylori are 
candidates as activators or inhibitors of the bacterial life cycle. These assays can be 

20 performed in vitro or in vivo. 

The invention features H. pylori polypeptides, preferably a substantially pure 
preparation of an H. pylori polypeptide, or a recombinant K pylori polypeptide. In 
preferred embodiments: the polypeptide has biological activity; the polypeptide has an 
amino acid sequence at least 60%, 70%, 80%, 90%, 95%, 98%, or 99% identical or 

25 homologous to an amino acid sequence of the invention contained in the Sequence 

Listing, preferably it has about 65% sequence identity with an amino acid sequence of 
the invention contained in the Sequence Listing, and most preferably it has about 92% to 
about 99% sequence identity with an amino acid sequence of the invention contained in 
the Sequence Listing; the polypeptide has an amino acid sequence essentially the same 

30 as an amino acid sequence of the invention contained in the Sequence Listing; the 
polypeptide is at least 5, 10, 2 J, 50, 100, or 150 amino acid residues in length; the 
polypeptide includes at least 5, preferably at least 10, more preferably at least 20, more 
preferably at least 50, 100, or 150 contiguous amino acid residues of the invention 
contained in the Sequence Listing. In yet another preferred embodiment, the amino acid 

35 sequence which differs in sequence identity by about 7% to about 8% from the H. pylori 
amino acid sequences of the invention contained in the Sequence Listing is also 
encompassed by the invention. 
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In preferred embodiments: the H. pylori polypeptide is encoded by a nucleic 
acid of the invention contained in the Sequence Listing, or by a nucleic acid having at 
least 60%, 70%, 80%, 90%, 95%, 98%, or 99% homology with a nucleic acid of the 
invention contained in the Sequence Listing. 
5 In a preferred embodiment, the subject H. pylori polypeptide differs in amino 

acid sequence at 1, 2, 3, 5, 10 or more residues from a sequence of the invention 
contained in the Sequence Listing. The differences, however, are such that the K pylori 
polypeptide exhibits an H. pylori biological activity, e.g., the H. pylori polypeptide 
retains a biological activity of a naturally occurring H. pylori polypeptide. 

10 In preferred embodiments, the polypeptide includes all or a fragment of an amino 

acid sequence of the invention contained in the Sequence Listing; fused, in reading 
frame, to additional amino acid residues, preferably to residues encoded by genomic 
DNA 5' or 3 1 to the genomic DNA which encodes a sequence of the invention contained 
in the Sequence Listing. 

1 5 In yet other preferred embodiments, the K pylori polypeptide is a recombinant 

fusion protein having a first H. pylori polypeptide portion and a second polypeptide 
portion, e.g., a second polypeptide portion having an amino acid sequence unrelated to 
K pylori. The second polypeptide portion can be, e.g., any of glutathione-S-transferase, 
a DNA binding domain, or a polymerase activating domain. In preferred embodiment 

20 the fusion protein can be used in a two-hybrid assay. 

Polypeptides of the invention include those which arise as a result of alternative 
transcription events, alternative RNA splicing events, and alternative translational and 
postradiational events. 

The invention also encompasses an immunogenic component which includes at 

25 least one H. pylori polypeptide in an immunogenic preparation; the immunogenic 
component being capable'of eliciting an immune response specific for the H. pylori 
polypeptide, e.g., a humoral response, an antibody response, or a cellular response. In 
preferred embodiments, the immunogenic component comprises at least one antigenic 
determinant from a polypeptide of the invention contained in the Sequence Listing. 

30 In another aspect, the invention provides a substantially pure nucleic acid having 

a nucleotide sequence which encodes an H. pylori polypeptide. In preferred 
embodiments: the encoded polypeptide has biological activity; the encoded polypeptide 
has an amino acid sequence at least 60%, 70%, 80%, 90%, 95%, 98%, or 99% 
homologous to an amino acid sequence of the invention contained in the Sequence 

35 Listing; the encoded polypeptide has an amino acid sequence essentially the same as an 
amino acid sequence of the invention contained in the Sequence Listing; the encoded 
polypeptide is at least 5, 10, 20, 50, 100, or 150 amino acids in length; the encoded 
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polypeptide comprises at least 5, preferably at least 10, more preferably at least 20, more 
preferably at least 50, 100, or 150 contiguous amino acids of the invention contained in 
the Sequence Listing. 

In preferred embodiments: the nucleic acid of the invention is that contained in 
5 the Sequence Listing; the nucleic acid is at least 60%, 70%, 80%, 90%, 95%, 98%, or 
99% homologous with a nucleic acid sequence of the invention contained in the 
Sequence Listing. 

In a preferred embodiment, the encoded H. pylori polypeptide differs (e.g., by 
amino acid substitution, addition or deletion of at least one amino acid residue) in amino 

10 acid sequence at 1, 2, 3, 5, 10 or more residues, from a sequence of the invention 
contained in the Sequence Listing. The differences, however, are such that: the H. 
pylori encoded polypeptide exhibits a K pylori biological activity, e.g., the encoded K 
pylori enzyme retains a biological activity of a naturally occurring K pylori. 

In preferred embodiments, the encoded polypeptide includes all or a fragment of 

1 5 an amino acid sequence of the invention contained in the Sequence Listing; fused, in 
reading frame, to additional amino acid residues, preferably to residues encoded by 
genomic DNA 5' or 3' to the genomic DNA which encodes a sequence of the invention 
contained in the Sequence Listing. 

In preferred embodiments, the subject H. pylori nucleic acid will include a 

20 transcriptional regulatory sequence, e.g. at least one of a transcriptional promoter or 
transcriptional enhancer sequence, operably linked to the H. pylori gene sequence, e.g., 
to render the K pylori gene sequence suitable for expression in a recombinant host cell. 

In yet a further preferred embodiment, the nucleic acid which encodes an H. 
pylori polypeptide of the invention, hybridizes under stringent conditions to a nucleic 

25 acid probe corresponding to at least 8 consecutive nucleotides of the invention contained 
in the Sequence Listing; more preferably to at least 12 consecutive nucleotides of the 
invention contained in the Sequence Listing; more preferably to at least 20 consecutive 
nucleotides of the invention contained in the Sequence Listing; more preferably to at 
least 40 consecutive nucleotides of the invention contained in the Sequence Listing. 

30 In a preferred embodiment, the nucleic acid encodes a peptide which differs by at 

least one amino acid residue from the sequences of the invention contained in the 
Sequence Listing. 

In a preferred embodiment, the nucleic acid differs by at least one nucleotide 
from a nucleotide sequence of the invention contained in the Sequence Listing which 
35 encodes amino acids of the invention contained in the Sequence Listing. 

In another aspect, the invention encompasses: a vector including a nucleic acid 
which encodes an K pylori polypeptide or an H. pylori polypeptide variant as described 
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herein; a host cell transfected with the vector; and a method of producing a recombinant 
H. pylori polypeptide or K pylori polypeptide variant; including culturing the cell, e.g., 
in a cell culture medium, and isolating the K pylori or H. pylori polypeptide variant, 
e.g., from the cell or from the cell culture medium. 
5 In another aspect, the invention features, a purified recombinant nucleic acid 

having at least 50%, 60%, 70%, 80%, 90%, 95%, 98%, or 99% homology with a 
sequence of the invention contained in the Sequence Listing. 

The invention also provides a probe or primer which includes a substantially 
purified oligonucleotide. The oligonucleotide includes a region of nucleotide sequence 

« 

10 which hybridizes under stringent conditions to at least 8 consecutive nucleotides of 
sense or antisense sequence of the invention contained in the Sequence Listing, or 
naturally occurring mutants thereof. In preferred embodiments, the probe or primer 
further includes a label group attached thereto. The label group can be, e.g., a 
radioisotope, a fluorescent compound, an enzyme, and/or an enzyme co-factor. 

15 Preferably the oligonucleotide is at least 8 and less than 10, 20, 30, 50, 100, or 150 
nucleotides in length. 

The invention also provides an isolated H. pylori polypeptide which is encoded 
by a nucleic acid which hybridizes under stringent hybridization conditions to a nucleic 
acid contained in the Sequence Listing. 

20 The invention further provides nucleic acids, e.g., RNA or DNA, encoding a 

polypeptide of the invention. This includes double stranded nucleic acids as well as 
coding and antisense single strands. 

The K pylori strain, from which genomic sequences have been sequenced, has 
been deposited in the American Type Culture Collection (ATCC # 55679; deposited by 

25 Genome Therapeutics Corporation, 100 Beaver Street, Waltham, MA 02154) as strain 
HP-J99. 

Included in the invention are: allelic variations; natural mutants; induced 
mutants; proteins encoded by DNA that hybridizes under high or low stringency 
conditions to a nucleic acid which encodes a polypeptide of the invention contained in 

30 the Sequence Listing (for definitions of high and low stringency see Current Protocols in 
Molecular Biology, John Wiley & Sons, New York. 1989, 6.3.1 - 6.3.6 and 6.4.1-6.4.10, 
hereby incorporated by reference); and, polypeptides specifically bound by antisera to H. 
pylori polypeptides, especially by antisera to an active site or binding domain of H. 
pylori polypeptide. The invention also includes fragments, preferably biologically 

35 active fragments. These and other polypeptides are also referred to herein as H. pylori 
polypeptide analogs or variants. 
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Putative functions have been determined for several of the K pylori polypeptides 
of the invention, as shown in Table 1 . 

Accordingly, uses of the claimed H. pylori polypeptides based on these identified 
functions, as well as other functions as described herein, are also within the scope of the 
5 invention. 

In addition, the present invention encompasses H. pylori polypeptides 
characterized as shown in Table 1 below, including: K pylori cell envelope proteins, K 
pylori secreted proteins, H. pylori cytoplasmic proteins and K pylori cellular proteins. 
Members of these groups were identified by BLAST homology searches and by searches 
10 for secretion signal or transmembrane protein motifs. Polypeptides related by 

significant homology to the polypeptides of Table 1 are also considered to be classified 
in the manner of the homologs shown in Table 1. 



TABLE 1 





ntSeqlD(PCT) 


aaSeqlD[PCT] 


A. CELL ENVELOPE 






A.1 Flagella-associated 






hp1 p1 3939^24322162 J3J 7 


63 


160 


A2 Outer membrane 






A2.1 Terminal phe residue 






02ge101 16_23462J2_43 


7 


104 


02ge1 01 16_804550_J2_44 


8 


105 


02ge41622_14875000_c2_65 


9 


106 


01cp20708_214843_c2_49 


13 


110 


01 cp20708_4960952_d _43 




111 


06ae 1101 6_4729625_c3_68 


23 


120 


06ep1061 5_49068_c2_87 


24 


121 


06gp71 906.351 58328J3_85 


27 


124 


06gp71906_3941642J2_70 


28 


125 


1 3ae 1 06 1 0_1 564 1 1 _c3_33 


50 


147 


1 3ae1 061 0_6522827_c3_37 


51 


148 


hp4e53394_1 1798952_c2_101 


61 


158 


06ge20501_4298568_c3_53 


79 


176 


1 1 ae1 2004_3367666_c2_41 


80 


177 


hp7e1 0433_5345837_p3_1 3 


84 


181 


1 4ce6 1 51 6_2460r * 1 '>J2_9 


85 


182 


1 1 ap207 1 4_2077_c3_1 03 


91 


188 


02 cp 1 06 1 5.2 1 908 1 38 J1 _4 


94 


191 


A2.2 No terminal phe residue 






07gp1 1909_26460892J2_6 


5 


102 


A2.3 Phe and Tyr cluster at C- 
terminus 






02ge41 622.34 1 7651 3_c1_50 


11 


108 
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06gp71 906_20486556 J2_65 


26 


123 


hp7e1 0520 J 47281 37 J1 J 


36 


133 


02 ae3 1 0 1 0_4 1 78 1 8_f3_29 


42 


139 


1 3ae1 061 0_2685531 3 J3_1 5 


52 


149 


A.2.4 Via homology 






hp5p1 521 2_1 3729635_c3_35 


22 


119 


07ee 1 1 402 J 046877_c3_1 00 


29 


126 


1 4 ee4 1 924_1 046877_c3_1 04 


30 


127 


hp1p13939 21641016 f1 1 


65 


162 


hp4p62853_4766691J3_23 


66 


163 


A *^ Innpr mpmhranp 












flfirn^nfin^ fift.4nR1 r1 04 


HO 


14*\ 

mo 






14R 

i**o 


04pn41Qfn 1RRfi70 t 5^ r1 ^7 


A 7 


1 14 




I o 


11^ 


14rp^1^1Q 94R^nnnQ r1 17 


1Q 

i y 


1 1ft 
I IO 


0Qpp1 041*3 9fi714fiH7 fl 91 


**o 


i4n 






141 


A 7 ? Othpr 'nnpr mpmhrAnp 

proteins 






02ae3 1 0 1 0_1 6679640_f2_2 1 


38 


135 


07ee50709_1 6679640 J3_60 


39 


136 


A.4 Other cell envelope proteins 






01ce61016 1056562 c3 123 


i 
i 


98 


09cd61003 16619192 c2 83 


2 




02qe10116 15632000 c2 114 


6 


103 


04ae61 51 7_12345837J2_4 


34 


" 131 


04ae6 1 5 1 7_2 1 74409 1 J 3_5 


35 


132 


hp4e1 3394_26750068_c3_1 13 


60 


157 


hp5p1 5575_1 053590_c1_35 


69 


166 


hp7e1 0433_5345837_c2_8 


83 


180 


B. CYTOPLASMIC PROTEINS 






8.1. Proteins involved in mRNA 
iransiaiion 






hn^pmQ4fi ^9RnQ41^ 4 


OA 


1 £4 




DO 


10D 


d.^ rroieins invoiveo in genome 
replication, transcription, 
recombination and repair 






1 4ce6 1 5 1 6_1 26009 37J2J 1 


86 


183 


1 4cp1 1 908_25402267_c3_1 04 


87 


184 


B.3 Other cytoplasmic proteins 






05ce 1 09 1 0_237 1 2780_c1 _4 


88 


185 


hp7e10192„23712780_f2_5 


89 


186 


1 1ap207 14^346639 10 J3_29 


92 


189 


hp8e1 0065^496281 2_f2_1 8 


93 


190 


C. SECRETED PROTEINS 






0 1 ce6 1 0 1 6_23 593955_c3_1 40 


3 100 
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09cp61003 23593955 d 79 

1 Mite 


4 


101 


02ge41622 20730462 f1 19 


10 


107 


01cp20708_10628177_c2_50 


12 


109 


0 5ae30220_244 1 5693_c3_1 75 


20 


117 


06gp10409 4015687 12 11 


25 


122 


hp2e10911 10213593 d 73 


31 


128 


hp2e10911 35567005 c2 88 


32 


129 


09ze10333_1457137_f3_1 1 


45 


142 


06cp30603_1 0744075_c3_1 36 


46 


143 


1 2ae1 0622_30273255 _f 1 J 3 


53 


150 


05ce10208_4707035_c2_1 7 


64 


161 


06ep30223_1 76437_c2_1 34 


67 


164 


hp5p1 5575_260 1 6387J2_1 6 


70 


167 


hp6p12244_4881375_c3_97 


77 


174 


06ce206 1 0_3464 7 1 87_c2_33 


78 


175 


hp7e10433_36339535J3_3 


81 


178 


hp7e10433_36339535J3_3 


82 


179 


hp7e10420_24391078_f1_3 


90 


187 


02ce7 1 01 8_35720091_c3_4 


95 


192 


hp6s10363_30517031_f3_3 


97 


194 


D. OTHER CELLULAR PROTEINS 






01ae1 1010_26437877_c2_52 


15 


112 


hp4p33322_5891077_c2_45 


16 


113 


hp3p21118 54628 c3 3 


21 


118 


02ae31 01 0_1064 1 25 _f1_1 1 


33 


130 


hp2e1091 1_15680337_c3_105 


37 


134 


hp2e1 091 1_24804577_c3_1 04 


40 


137 


hp2e1091 1_32234750_c1_68 


41 


138 


06cp30603_26070252_c3_1 40 


47 


144 


03ae1 0804_235286_f3_1 9 


54 


151 


09ge11604 4804692 d 8 


55 


152 


h p2p 1 06 1 0_2 1 987687_c2_5 


56 


153 


hp4e 1 3394_261 82793_f2_45 


59 


156 


hp4e53394_2082 1 26_c2_1 02 


62 


159 


06ep30223_25402187_c1_112 


68 


165 


hp6e1 0491 J 271 2706 J3J 2 


71 


168 


hp6p1 21 29_1 2542880_c3_29 


72 


169 


hp6p12129_1 7067265_c3_29 


73 


170 


hp6p1 2 1 29_2 1 4055_f 1 _2 


74 


171 


hp6p12129_214055J3_17 


75 


172 


hp6p1 2244_3349271 2_c3_88 


76 


173 


hp1 e1 3054_22360653 J2_4 


96 


193 



[In Table 1, "nt" represents nucleotide Seq. ID number and "aa" represents amino acid 
Seq. ID number] 
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Definitions 

The terms "purified polypeptide" and "isolated polypeptide" and "a substantially 
pure preparation of a polypeptide" are used interchangeably herein and, as used herein, 
mean a polypeptide that has been substantially, and preferably completely, separated 
5 from other proteins, lipids, and nucleic acids with which it naturally occurs. Preferably, 
the polypeptide is also separated from substances, e.g., antibodies or gel matrix, e.g., 
polyacrylamide, which are used to purify it. Preferably, the polypeptide constitutes at 
least 10, 20, 50 70, 80 or 95% dry weight of the purified preparation. Preferably, the 
preparation contains: sufficient polypeptide to allow protein sequencing; at least 1,10, 

10 or 100 jig of the polypeptide; at least 1, 10, or 100 mg of the polypeptide. Furthermore, 
the terms "purified polypeptide" and "isolated polypeptide" and "a substantially pure 
preparation of a polypeptide," as used herein, refer to both a polypeptide obtained from 
nature or produced by recombinant DNA techniques as described herein. 

For example, an "isolated" or "purified" protein or biologically active portion 

1 5 thereof is substantially free of cellular material or other contaminating proteins from the 
cell or tissue source from which the H. pylori protein is derived, or substantially free 
from chemical precursors or other chemicals when chemically synthesized. The 
language "substantially free of cellular material" includes preparations of H. pylori 
protein in which the protein is separated from cellular components of the cells from 

20 which it is isolated or recombinantly produced. In one embodiment, the language 

"substantially free of cellular material" includes preparations of H. pylori protein having 
less than about 30% (by dry weight) of non-//. pylori protein (also referred to herein as a 
"contaminating protein"), more preferably less than about 20% of non-// pylori protein, 
still more preferably less than about 10% of non-//. pylori protein, and most preferably 

25 less than about 5% non-//. pylori protein. When the H. pylori protein or biologically 
active portion thereof is recombinantly produced, it is also preferably substantially free 
of culture medium, i.e., culture medium represents less than about 20%, more preferably 
less than about 10%, and most preferably less than about 5% of the volume of the 
protein preparation. 

30 The language "substantially free of chemical precursors or other chemicals" 

includes preparations of H. pylori protein in which the protein is separated from 
chemical precusors or other chemicals which are involved in the synthesis of the protein. 
In one embodiment, the language "substantially free of chemical precursors or other 
chemicals" includes preparations of K pylori protein having less than about 30% (by dry 

35 weight) of chemical precursors or non-//. pylori chemicals, more preferably less than 
about 20% chemical precursors or non-// pylori chemicals, still more preferably less 
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than about 1 0% chemical precursors or non-//. pylori chemicals, and most preferably 
less than about 5% chemical precursors or non-//. pylori chemicals. 

A purified preparation of cells refers to, in the case of plant or animal cells, an in 
vitro preparation of cells and not an entire intact plant or animal. In the case of cultured 
5 cells or microbial cells, it consists of a preparation of at least 10% and more preferably 
50% of the subject cells. 

A purified or isolated or a substantially pure nucleic acid, e.g., a substantially 
pure DNA, (are terms used interchangeably herein) is a nucleic acid which is one or both 
of the following: not immediately contiguous with both of the coding sequences with 

10 which it is immediately contiguous (i.e., one at the 5' end and one at the 3' end) in the 
naturally-occurring genome of the organism from which the nucleic acid is derived; or 
which is substantially free of a nucleic acid with which it occurs in the organism from 
which the nucleic acid is derived. The term includes, for example, a recombinant DNA 
which is incorporated into a vector, e.g., into an autonomously replicating plasmid or 

1 5 virus, or into the genomic DNA of a prokaryote or eukaryote, or which exists as a 
separate molecule (e.g., a cDNA or a genomic DNA fragment produced by PCR or 
restriction endonuclease treatment) independent of other DNA sequences. Substantially 
pure DNA also includes a recombinant DNA which is part of a hybrid gene encoding 
additional K pylori DNA sequence. 

20 A "contig" as used herein is a nucleic acid representing a continuous stretch of 

genomic sequence of an organism. 

An "open reading frame", also referred to herein as ORF, is a region of nucleic 
acid which encodes a polypeptide. This region may represent a portion of a coding 
sequence or a total sequence and can be determined from a stop to stop codon or from a 

25 start to stop codon. 

As used herein, a "coding sequence" is a nucleic acid which is transcribed into 
messenger RNA and/or translated into a polypeptide when placed under the control of 
appropriate regulatory sequences. The boundaries of the coding sequence are 
determined by a translation start codon at the five prime terminus and a translation stop 

30 code at the three prime terminus. A coding sequence can include but is not limited to 
messenger F.nIA, synthetic DNA, and recombinant nucleic acid sequences. 

A "complement" of a nucleic acid as used herein referes to an anti-parallel or 
antisense sequence that participates in Watson-Crick base-pairing with the original 
sequence. 

35 A "gene product" is a protein or structural RNA which is specifically encoded by 

a gene. 
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As used herein, the term "probe" refers to a nucleic acid, peptide or other 
chemical entity which specifically binds to a molecule of interest. Probes are often 
associated with or capable of associating with a label. A label is a chemical moiety 
capable of detection. Typical labels comprise dyes, radioisotopes, luminescent and 
5 chemiluminescent moieties, fluorophores, enzymes, precipitating agents, amplification 
sequences, and the like. Similarly, a nucleic acid, peptide or other chemical entity which 
specifically binds to a molecule of interest and immobilizes such molecule is referred 
herein as a "capture ligand". Capture ligands are typically associated with or capable of 
associating with a support such as nitro-cellulose, glass, nylon membranes, beads, 

10 particles and the like. The specificity of hybridization is dependent on conditions such 
as the base pair composition of the nucleotides, and the temperature and salt 
concentration of the reaction. These conditions are readily discernable to one of 
ordinary skill in the art using routine experimentation. 

Homologous refers to the sequence similarity or sequence identity between two 

1 5 polypeptides or between two nucleic acid molecules. When a position in both of the two 
compared sequences is occupied by the same base or amino acid monomer subunit, e.g., 
if a position in each of two DNA molecules is occupied by adenine, then the molecules 
are homologous at that position. The percent of homology between two sequences is a 
function of the number of matching or homologous positions shared by the two 

20 sequences divided by the number of positions compared x 100. For example, if 6 of 10 
of the positions in two sequences are matched or homologous then the two sequences are 
60% homologous. By way of example, the DNA sequences ATTGCC and TATGGC 
share 50% homology. Generally, a comparison is made when two sequences are aligned 
to give maximum homology. 

25 Nucleic acids are hybridizable to each other when at least one strand of a nucleic 

acid can anneal to the other nucleic acid under defined stringency conditions. 
Stringency of hybridization is determined by: (a) the temperature at which hybridization 
and/or washing is performed; and (b) the ionic strength and polarity of the hybridization 
and washing solutions. Hybridization requires that the two nucleic acids contain 

30 complementary sequences; depending on the stringency of hybridization, however, 
mismatches may be tolerated. Typically, hybridization of two sequences at high 
stingency (such as, for example, in a solution of 0.5X SSC, at 65° C) requires that the 
sequences be essentially completely homologous. Conditions of intermediate stringency 
(such as, for example, 2X SSC at 65 0 C) and low stringency (such as, for example 2X 

35 SSC at 55° C), require correspondingly less overall complementarity between the 
hybridizing sequences. (IX SSC is 0.15 M NaCl, 0.015 M Na citrate). A preferred, 
non-limiting example of stringent hybridization conditions are hybridization in 6X 
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sodium chloride/sodium citrate (SSC) at about 45°C, followed by one or more washes in 
0.2 X SSC, 0.1% SDS at 50-65°C. 

The terms peptides, proteins, and polypeptides are used interchangeably herein. 

As used herein, the term "surface protein" refers to all surface accessible 
5 proteins, e.g. inner and outer membrane proteins, proteins adhering to the cell wall, and 
secreted proteins. 

A polypeptide has H. pylori biological activity if it has one, two and preferably 
more of the following properties: (1) if when expressed in the course of an H. pylori 
infection, it can promote, or mediate the attachment of H. pylori to a cell; (2) it has an 
10 enzymatic activity, structural or regulatory function characteristic of an H. pylori 

protein; (3) the gene which encodes it can rescue a lethal mutation in an H. pylori gene; 
(4) or it is immunogenic in a subject. A polypeptide has biological activity if it is an 
antagonist, agonist, or super-agonist of a polypeptide having one of the above-listed 
properties. 

1 5 A biologically active fragment or analog is one having an in vivo or in vitro 

activity which is characteristic of the K pylori polypeptides of the invention contained 
in the Sequence Listing, or of other naturally occurring H. pylori polypeptides, e.g., one 
or more of the biological activities described herein. Especially preferred are fragments 
which exist in vivo, e.g., fragments which arise from post transcriptional processing or 

20 which arise from translation of alternatively spliced RNA's. Fragments include those 
expressed in native or endogenous cells as well as those made in expression systems, 
e.g., in CHO cells. Because peptides such as K pylori polypeptides often exhibit a 
range of physiological properties and because such properties may be attributable to 
different portions of the molecule, a useful K pylori fragment or H. pylori analog is one 

25 which exhibits a biological activity in any biological assay for H. pylori activity. Most 
preferably the fragment or analog possesses 10%, preferably 40%, more preferably 60%, 
70%, 80% or 90% or greater of the activity of H. pylori, in any in vivo or in vitro assay. 

Analogs can differ from naturally occurring H. pylori polypeptides in amino acid 
sequence or in ways that do not involve sequence, or both. Non-sequence modifications 

30 include changes in acetylation, methylation, phosphorylation, carboxylation, or 

glycosylation. Preferred analogs include K pylori polypeptides (or biologically act' ve 
fragments thereof) whose sequences differ from the wild-type sequence by one or more 
conservative amino acid substitutions or by one or more non-conservative amino acid 
substitutions, deletions, or insertions which do not substantially diminish the biological 

35 activity of the K pylori polypeptide. Conservative substitutions typically include the 
substitution of one amino acid for another with similar characteristics, e.g., substitutions 
within the following groups: valine, glycine; glycine, alanine; valine, isoleucine, 
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leucine; aspartic acid, glutamic acid; asparagine, glutamine; serine, threonine; lysine, 
arginine; and phenylalanine, tyrosine. Other conservative substitutions can be made in 
view of the table below. 



5 TABLE 2 

CONSERVATIVE AMINO ACID REPLACEMENTS 



For Amino Acid 


Code 


i ■ — ■ ■..■■rTrrr.i ■;■ tt — : — <J .tt .i— i; 

Replace with any of 


Alanine 


A 


D-Ala, Gly, beta-Ala, L-Cys, D-Cys 


Arginine 


R 


D-Arg, Lys, D-Lys, homo-Arg, D-homo-Arg, Met, He, 
D-Met, D-Ile, Orn, D-Orn 


Asparagine 


N 


D-Asn, Asp, D-Asp, Glu, D-Glu, Gin, D-Gln 


Aspartic Acid 


D 


D-Asp, D-Asn, Asn, Glu, D-Glu, Gin, D-Gln 


Cysteine 


C 


D-Cys, S-Me-Cys, Met, D-Met, Thr, D-Thr 


Glutamine 


Q 


D-Gln, Asn, D-Asn, Glu, D-Glu, Asp, D-Asp 


Glutamic Acid 


E 


D-Glu, D-Asp, Asp, Asn, D-Asn, Gin, D-Gln 


Olycine 


kj 


Ala, u-Aia, rro, u-rro, p-/\ia, Acp 


Isoleucine 


I 


D He, Val, D-Val, Leu, D-Leu, Met, D-Met 


Leucine 


L 


D-Leu, Val, D-Val, Leu, D-Leu, Met, D-Met 


Lysine 


K 


D-Lys, Arg, D-Arg, homo-Arg, D-homo-Arg, Met, D- 
Met, He, D-Ile, Orn, D-Orn 


Methionine 


M 


D-Met, S-Me-Cys, He, D-Ile, Leu, D-Leu, Val, D-Val 


Phenylalanine 


F 


D-Phe, Tyr, D-Thr, L-Dopa, His, D-His, Trp, D-Trp, 
Trans-3,4, or 5-phenylproline, cis-3,4, or 5- 
phenylproline 


Proline 


P 


D-Pro, L-I-thioazolidine-4-carboxylic acid, D-or L-l- 
oxazolidine-4-carboxylic acid 


Serine 


S 


D-Ser, Thr, D-Thr, allo-Thr, Met, D-Met, Met(O), 
D-Met(O), L-Cys, D-Cys 


Threonine 


T 


D-Thr, Ser, D-Ser, allo-Thr, Met, D-Met, Met(O), 
D-Met(O), Val, D-Val 


Tyrosine 


Y 


D-Tyr, Phe, D-Phe, L-Dopa, His, D-His 


Valine 


V 


D-Val, Leu, D-Leu, He, D-Ile, Met, D-Met 



Other analogs within the invention are those with modifications which increase 
peptide stability; such analogs may contain, for example, one or more non-peptide bonds 
10 (which replace the peptide bonds) in the peptide sequence. Also included are: analogs 
that include residues other than naturally occurring L-amino acids, e.g., D-amino acids 
or non-naturally occurring or synthetic amino acids, e.g., p or y amino acids; and cyclic 
analogs. 

As used herein, the term "fragment", as applied to an K pylori analog, will 
1 5 ordinarily be at least about 20 residues, more typically at least about 40 residues, 
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preferably at least about 60 residues in length. Fragments of K pylori polypeptides can 
be generated by methods known to those skilled in the art. The ability of a candidate 
fragment to exhibit a biological activity of K pylori polypeptide can be assessed by 
methods known to those skilled in the art as described herein. Also included are H. 
5 pylori polypeptides containing residues that are not required for biological activity of the 
peptide or that result from alternative mRNA splicing or alternative protein processing 
events. 

An "immunogenic component" as used herein is a moiety, such as an H. pylori 
polypeptide, analog or fragment thereof, that is capable of eliciting a humoral and/or 
1 0 cellular immune response in a host animal alone or in combination with an adjuvant. 

An "antigenic component" as used herein is a moiety, such as an K pylori 
polypeptide, analog or fragment thereof, that is capable of binding to a specific antibody 
with sufficiently high affinity to form a detectable antigen-antibody complex. 

As used herein, the term "transgene" means a nucleic acid (encoding, e.g., one or 
15 more polypeptides), which is partly or entirely heterologous, i.e., foreign, to the 

transgenic animal or cell into which it is introduced, or, is homologous to an endogenous 
gene of the transgenic animal or cell into which it is introduced, but which is designed to 
be inserted, or is inserted, into the cell's genome in such a way as to alter the genome of 
the cell into which it is inserted (e.g., it is inserted at a location which differs from that 
20 of the natural gene or its insertion results in a knockout). A transgene can include one or 
more transcriptional regulatory sequences and any other nucleic acid, such as introns, 
that may be necessary for optimal expression of the selected nucleic acid, all operably 
linked to the selected nucleic acid, and may include an enhancer sequence. 

As used herein, the term "transgenic cell" refers to a cell containing a transgene. 
25 As used herein, a "transgenic animal" is any animal in which one or more, and 

preferably essentially all, of the cells of the animal includes a transgene. The transgene 
can be introduced into the cell, directly or indirectly by introduction into a precursor of 
the cell, by way of deliberate genetic manipulation, such as by a process of 
transformation of competent cells or by microinjection or by infection with a 
30 recombinant virus. This molecule may be integrated within a chromosome, or it may be 
extrachromosomallv replicating DNA. 

The term "antibody" as used herein is intended to include fragments thereof 
which are specifically reactive with H. pylori polypeptides. 

As used herein, the term "cell-specific promoter" means a DNA sequence that 
35 serves as a promoter, i.e., regulates expression of a selected DNA sequence operably 
linked to the promoter, and which effects expression of the selected DNA sequence in 
specific cells of a tissue. The term also covers so-called "leaky" promoters, which 
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regulate expression of a selected DNA primarily in one tissue, but cause expression in 
other tissues as well. 

Misexpression, as used herein, refers to a non-wild type pattern of gene 
expression. It includes: expression at non-wild type levels, i.e., over or under 
5 expression; a pattern of expression that differs from wild type in terms of the time or 
stage at which the gene is expressed, e.g., increased or decreased expression (as 
compared with wild type) at a predetermined developmental period or stage; a pattern of 
expression that differs from wild type in terms of decreased expression (as compared 
with wild type) in a predetermined cell type or tissue type; a pattern of expression that 

10 differs from wild type in terms of the splicing size, amino acid sequence, post- 
transitional modification, or biological activity of the expressed polypeptide; a pattern of 
expression that differs from wild type in terms of the effect of an environmental stimulus 
or extracellular stimulus on expression of the gene, e.g., a pattern of increased or 
decreased expression (as compared with wild type) in the presence of an increase or 

1 5 decrease in the strength of the stimulus. 

As used herein, "host cells" and other such terms denoting microorganisms or 
higher eukaryotic cell lines cultured as unicellular entities refers to cells which can 
become or have been used as recipients for a recombinant vector or other transfer DNA, 
and include the progeny of the original cell which has been transfected. It is understood 

20 by individuals skilled in the art that the progeny of a single parental cell may not 

necessarily be completely identical in genomic or total DNA compliment to the original 
parent, due to accident or deliberate mutation. 

As used herein, the term "control sequence" refers to a nucleic acid having a base 
sequence which is recognized by the host organism to effect the expression of encoded 

25 sequences to which they are ligated. The nature of such control sequences differs 
depending upon the host organism; in prokaryotes, such control sequences generally 
include a promoter, ribosomal binding site, terminators, and in some cases operators; in 
eukaryotes, generally such control sequences include promoters, terminators and in 
some instances, enhancers. The term control sequence is intended to include at a 

30 minimum, all components whose presence is necessary for expression, and may also 
include additional components whose presence is advantageous, for example, leader 
sequences. 

As used herein, the term "operably linked" refers to sequences joined or ligated 
to function in their intended manner. For example, a control sequence is operably linked 
35 to coding sequence by ligation in such a way that expression of the coding sequence is 
achieved under conditions compatible with the control sequence and host cell. 
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The metabolism of a substance, as used herein, means any aspect of the, 
expression, function, action, or regulation of the substance. The metabolism of a 
substance includes modifications, e.g., covalent or non-covalent modifications of the 
substance. The metabolism of a substance includes modifications, e.g., covalent or non- 
5 covalent modification, the substance induces in other substances. The metabolism of a 
substance also includes changes in the distribution of the substance. The metabolism of 
a substance includes changes the substance induces in the distribution of other 
substances. 

A "sample" as used herein refers to a biological sample, such as, for example, 
10 tissue or fluid isloated from an individual (including without limitation plasma, serum, 
cerebrospinal fluid, lymph, tears, saliva and tissue sections) or from in vitro cell culture 
constituents, as well as samples from the environment. 

The practice of the invention will employ, unless otherwise indicated, 
conventional techniques of chemistry, molecular biology, microbiology, recombinant 
15 DNA, and immunology, which are within the skill of the art. Such techniques are 

explained fully in the literature. See e.g., Sambrook, Fritsch, and Maniatis, Molecular 
Cloning; Laboratory Manual 2nd ed. (1989); DNA Cloning, Volumes I and II (D.N 
Glover ed. 1985); Oligonucleotide Synthesis (M J. Gaited, 1984); Nucleic Acid 
Hybridization (B.D. Hames & S.J. Higgins eds. 1984); the series, Methods in 
20 Enzymoloqy (Academic Press, Inc.), particularly Vol. 154 and Vol. 155 (Wu and 

Grossman, eds.) and PCR-A Practical Approach (McPherson, Quirke, and Taylor, eds., 
1991). 

■ 

I. Isolation of Nucleic Acids of H. pylori and Uses Therefor 

25 

H. pylori Genomic Sequence 

This invention provides nucleotide sequences of the genome of H. pylori which 
thus comprises a DNA sequence library of//, pylori genomic DNA. The detailed 
description that follows provides nucleotide sequences of//, pylori, and also describes 

30 how the sequences were obtained and how ORFs and protein-coding sequences were 
identified. Also described are methods of using the disclosed K pylori secuences in 
methods including diagnostic and therapeutic applications. Furthermore, the library can 
be used as a database for identification and comparison of medically important 
sequences in this and other strains of H. pylori. 

35 To determine the genomic sequence of H. pylori, DNA was isolated from a strain 

of H. pylori (ATCC # 55679; deposited by Genome Therapeutics Corporation, 100 
Beaver Street, Waltham, MA 02154) and mechanically sheared by nebulization to a 
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median size of 2 kb. Following size fractionation by gel electrophoresis, the fragments 
were blunt-ended, ligated to adapter oligonucleotides, and cloned into each of 20 
different pMPX vectors (Rice et al., abstracts of Meeting of Genome Mapping and 
Sequencing, Cold Spring Harbor, NY, 5/1 1-5/15, 1994, p. 225) to construct a series of 
5 "shotgun" subclone libraries. 

DNA sequencing was achieved using multiplex sequencing procedures 
essentially as disclosed in Church et al., 1988, Science 240:185; U.S. Patents No. 
4,942,124 and 5,149,625). DNA was extracted from pooled cultures and subjected to 
chemical or enzymatic sequencing. Sequencing reactions were resolved by 

1 0 electrophoresis, and the products were transferred and covalently bound to nylon 
membranes. Finally, the membranes were sequentially hybridized with a series of 
labelled oligonucleotides complimentary to "tag" sequences present in the different 
shotgun cloning vectors. In this manner, a large number of sequences could be obtained 
from a single set of sequencing reactions. The cloning and sequencing procedures are 

15 described in more detail in the Exemplification. 

Individual sequence reads obtained in this manner were assembled using the 
FALCON™ program (Church et al, 1994, Automated DNA Sequencing and Analysis, 
J.C. Venter, ed., Academic Press) and PHRAP (P. Green, Abstracts of DOE Human 
Genome Program Contractor-Grantee Workshop V, Jan. 1996, p. 157). The average 

20 contig length was about 3-4 kb. 

A variety of approaches are used to order the contigs so as to obtain a continuous 
sequence representing the entire K pylori genome. Synthetic oligonucleotides are 
designed that are complementary to sequences at the end of each contig. These 
oligonucleotides may be hybridized to libaries of H. pylori genomic DNA in, for 

25 example, lambda phage vectors or plasmid vectors to identify clones that contain 
sequences corresponding to the junctional regions between individual contigs. Such 
clones are then used to isolate template DNA and the same oligonucleotides are used as 
primers in polymerase chain reaction (PCR) to amplify junctional fragments, the 
nucleotide sequence of which is then determined. 

30 The H. pylori sequences were analyzed for the presence of open reading frames 

(ORFs) comprising at least 1 80 nucleotides As a result of the analysis of ORFs based 
on stop-to-stop codon reads, it should be understood that these ORFs may not 
correspond to the ORF of a naturally-occurring H. pylori polypeptide. These ORFs may 
contain start codons which indicate the initiation of protein synthesis of a naturally- 

35 occurring H. pylori polypeptide. Such start codons within the ORFs provided herein can 
be identified by those of ordinary skill in the relevant art, and the resulting ORF and the 
encoded H. pylori polypeptide is within the scope of this invention. For example, within 
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the ORFs a codon such as AUG or GUG (encoding methionine or valine) which is part 
of the initiation signal for protein synthesis can be identified and the ORF modified to 
correspond to a naturally-occurring K pylori polypeptide. The predicted coding regions 
were defined by evaluating the coding potential of such sequences with the program 
5 GENEMARK™ (Borodovsky and Mclninch, 1993, Comp Chem, 17:123). 

Other H. pylori Nucleic Acids 

The nucleic acids of this invention may be obtained directly from the DNA of the 
above referenced H. pylori strain by using the polymerase chain reaction (PCR). See 

1 0 "PCR, A Practical Approach' 1 (McPherson, Quirke, and Taylor, eds., IRL Press, Oxford, 
UK, 1991) for details about the PCR. High fidelity PCR can be used to ensure a faithful 
DNA copy prior to expression. In addition, the authenticity of amplified products can be 
checked by conventional sequencing methods. Clones carrying the desired sequences 
described in this invention may also be obtained by screening the libraries by means of 

15 the PCR or by hybridization of synthetic oligonucleotide probes to filter lifts of the 
library colonies or plaques as known in the art (see, e.g., Sambrook et al., Molecular 
Cloning, A Laboratory Manual 2nd edition, 1989, Cold Spring Harbor Press, NY). 

It is also possible to obtain nucleic acids encoding H. pylori polypeptides from a 
cDNA library in accordance with protocols herein described. A cDNA encoding an H. 

20 pylori polypeptide can be obtained by isolating total mRNA from an appropriate strain. 
Double stranded cDNAs can then be prepared from the total mRNA. Subsequently, the 
cDNAs can be inserted into a suitable plasmid or viral (e.g., bacteriophage) vector using 
any one of a number of known techniques. Genes encoding H. pylori polypeptides can 
also be cloned using established polymerase chain reaction techniques in accordance 

25 with the nucleotide sequence information provided by the invention. The nucleic acids 
of the invention can be DNA or RNA. Preferred nucleic acids of the invention are 
contained in the Sequence Listing. 

The nucleic acids of the invention can also be chemically synthesized using 
standard techniques. Various methods of chemically synthesizing polydeoxynucleotides 

30 are known, including solid-phase synthesis which, like peptide synthesis, has been fully 
automated in commercially available DNA synthesizers (See e.g., Itakura et al U.S. 
Patent No. 4,598,049; Caruthers et aL U.S. Patent No. 4,458,066; and Itakura U.S. 
Patent Nos. 4,401,796 and 4,373,071, incorporated by reference herein). 

Nucleic acids isolated or synthesized in accordance with features of the present 

35 invention are useful, by way of example, without limitation, as probes, primers, capture 
ligands, antisense genes and for developing expression systems for the synthesis of 
proteins and peptides corresponding to such sequences. As probes, primers, capture 
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ligands and antisense agents, the nucleic acid normally consists of all or part 
(approximately twenty or more nucleotides for specificity as well as the ability to form 
stable hybridization products) of the nucleic acids of the invention contained in the 
Sequence Listing. These uses are described in further detail below. 
5 Probes 

A nucleic acid isolated or synthesized in accordance with the sequence of the 
invention contained in the Sequence Listing can be used as a probe to specifically detect 
H pylori. With the sequence information set forth in the present application, sequences 
of twenty or more nucleotides are identified which provide the desired inclusivity and 
10 exclusivity with respect to H pylori, and extraneous nucleic acids likely to be 
encountered during hybridization conditions. More preferably, the sequence will 
comprise at least twenty to thirty nucleotides to convey stability to the hybridization 
product formed between the probe and the intended target molecules. 

Sequences larger than 1000 nucleotides in length are difficult to synthesize but 
15 can be generated by recombinant DNA techniques. Individuals skilled in the art will 

readily recognize that the nucleic acids, for use as probes, can be provided with a label to 
facilitate detection of a hybridization product. 

Nucleic acid isolated and synthesized in accordance with the sequence of the 
invention contained in the Sequence Listing can also be useful as probes to detect 
20 homologous regions (especially homologous genes) of other Helicobacter species using 
appropriate stringency hybridization conditions as described herein. 
Capture Ligand 

For use as a capture ligand, the nucleic acid selected in the manner described 
above with respect to probes, can be readily associated with a support. The manner in 

25 which nucleic acid is associated with supports is well known. Nucleic acid having 
twenty or more nucleotides in a sequence of the invention contained in the Sequence 
Listing have utility to separate H pylori nucleic acid from the nucleic acid of each other 
and other organisms. Nucleic acid having twenty or more nucleotides in a sequence of 
the invention contained in the Sequence Listing can also have utility to separate other 

30 Helicobacter species from each other and from other organisms. Preferably, the 
sequence will comprise at least twenty nucleotides to convey stability to the 
hybridization product formed between the probe and the intended target molecules. 
Sequences larger than 1000 nucleotides in length are difficult to synthesize but can be 
generated by recombinant DNA techniques. 
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Primers 

Nucleic acid isolated or synthesized in accordance with the sequences described 
herein have utility as primers for the amplification ofK pylori nucleic acid. These 
nucleic acids may also have utility as primers for the amplification of nucleic acids in 
5 other Helicobacter species. With respect to polymerase chain reaction (PCR) 

techniques, nucleic acid sequences of > 10-15 nucleotides of the invention contained in 
the Sequence Listing have utility in conjunction with suitable enzymes and reagents to 
create copies of H. pylori nucleic acid. More preferably, the sequence will comprise 
twenty or more nucleotides to convey stability to the hybridization product formed 

10 between the primer and the intended target molecules. Binding conditions of primers 
greater than 100 nucleotides are more difficult to control to obtain specificity. High 
fidelity PCR can be used to ensure a faithful DNA copy prior to expression. In addition, 
amplified products can be checked by conventional sequencing methods. 

The copies can be used in diagnostic assays to detect specific sequences, 

1 5 including genes from H. pylori and/or other Helicobacter species. The copies can also 
be incorporated into cloning and expression vectors to generate polypeptides 
corresponding to the nucleic acid synthesized by PCR, as is described in greater detail 
herein. 

20 Antisense 

Nucleic acid or nucleic acid-hybridizing derivatives isolated or synthesized in 
accordance with the sequences described herein have utility as antisense agents to 
prevent the expression of H. pylori genes. These sequences also have utility as antisense 
agents to prevent expression of genes of other Helicobacter species. 

25 In one embodiment, nucleic acid or derivatives corresponding to H. pylori 

nucleic acids is loaded into a suitable carrier such as a liposome or bacteriophage for 
introduction into bacterial cells. For example, a nucleic acid having twenty or more 
nucleotides is capable of binding to bacteria nucleic acid or bacteria messenger RNA. 
Preferably, the antisense nucleic acid is comprised of 20 or more nucleotides to provide 

30 necessary stability of a hybridization product of non-naturally. occurring nucleic acid and 
bacterial nucleic acid and/or bacterial messenger RNA. Nucleic ? jid having a sequence 
greater than 1000 nucleotides in length is difficult to synthesize but can be generated by 
recombinant DNA techniques. Methods for loading antisense nucleic acid in liposomes 
is known in the art as exemplified by U.S. Patent 4,241,046 issued December 23, 1980 

35 to Papahadjopoulos et al. 



WO 98/24475 



PCT/US97/22104 



-41 - 

II. Expression of H pylori Nucleic Acids 

Nucleic acid isolated or synthesized in accordance with the sequences described 
herein have utility to generate polypeptides. The nucleic acid of the invention 
exemplified in the Sequence Listing or fragments of the nucleic acid encoding active 
5 portions of H pylori polypeptides can be cloned into suitable vectors or used to isolate 
nucleic acid. The isolated nucleic acid is combined with suitable DNA linkers and 
cloned into a suitable vector. 

The function of a specific gene or operon can be ascertained by expression in a 
bacterial strain under conditions where the activity of the gene product(s) specified by 

10 the gene or operon in question can be specifically measured. Alternatively, a gene 

product may be produced in large quantities in an expressing strain for use as an antigen, 
an industrial reagent, for structural studies, etc. This expression can be accomplished in 
a mutant strain which lacks the activity of the gene to be tested, or in a strain that does 
not produce the same gene product(s). This includes, but is not limited to other 

15 Helicobacter strains, or other bacterial strains such as £. coli, Norcardia, 

Corynebacterium, Campylobacter 9 and Streptomyces species. In some cases the 
expression host will utilize the natural Helicobacter promoter whereas in others, it will 
be necessary to drive the gene with a promoter sequence derived from the expressing 
organism (e.g., an E. coli beta-galactosidase promoter for expression in E. coli). 

20 To express a gene product using the natural H. pylori promoter, a procedure such 

as the following can be used. A restriction fragment containing the gene of interest, 
together with its associated natural promoter element and regulatory sequences 
(identified using the DNA sequence data) is cloned into an appropriate recombinant 
plasmid containing an origin of replication that functions in the host organism and an 

25 appropriate selectable marker. This can be accomplished by a number of procedures 
known to those skilled in the art. It is most preferably done by cutting the plasmid and 
the fragment to be cloned with the same restriction enzyme to produce compatible ends 
that can be ligated to join the two pieces together. The recombinant plasmid is 
introduced into the host organism by, for example, electroporation and cells containing 

30 the recombinant plasmid are identified by selection for the marker on the plasmid. 

Expression of the desired gene product is detected using an assay specific for that gene 
product. 

In the case of a gene that requires a different promoter, the body of the gene 
(coding sequence) is specifically excised and cloned into an appropriate expression 
35 plasmid. This subcloning can be done by several methods, but is most easily 
accomplished by PCR amplification of a specific fragment and ligation into an 
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expression plasmid after treating the PCR product with a restriction enzyme or 
exonuclease to create suitable ends for cloning. 

A suitable host cell for expression of a gene can be any procaryotic or eucaryotic 
cell. For example, an K pylori polypeptide can be expressed in bacterial cells such as E. 
5 coli y insect cells (baculovirus), yeast, or mammalian cells such as Chinese hamster ovary 
cell (CHO). Other suitable host cells are known to those skilled in the art. 

Expression in eucaryotic cells such as mammalian, yeast, or insect cells can lead 
to partial or complete glycosylation and/or formation of relevant inter- or intra-chain 
disulfide bonds of a recombinant peptide product. Examples of vectors for expression in 
1 0 yeast S. cerivisae include pYepSecl (Baldari. et al., ( 1 987) Embo J. 6:229-234), pMFa 
(Kurjan and Herskowitz, (1982) Cell 30:933-943), pJRY88 (Schultz et al., (1987) Gene 
54:1 13-123), and pYES2 (Invitrogen Corporation, San Diego, CA). Baculovirus vectors 
available for expression of proteins in cultured insect cells (SF 9 cells) include the pAc 
series (Smith et al., (1983) Mol Cell Biol 3:2156-21 65) and the pVL series (Lucklow, 

15 V.A., and Summers, M.D., (1989) Viroloev 170:31-39). Generally, COS cells 

(Gluzman, Y., (1981) Cell 23:1 75-1 82) are used in conjunction with such vectors as 
pCDM 8 (Aruffo, A. and Seed, B., (1987) Proc. Natl. Acad Sci. USA 84*8573-8577) for 
transient amplification/expression in mammalian cells, while CHO (dhfr Chinese 
Hamster Ovary) cells are used with vectors such as pMT2PC (Kaufman et al. (1987), 

20 EMBO J. 6: 1 87- 1 95) for stable amplification/expression in mammalian cells. Vector 
DNA can be introduced into mammalian cells via conventional techniques such as 
calcium phosphate or calcium chloride co-precipitation, DEAE-dextran-mediated 
transfection, or electroporation. Suitable methods for transforming host cells can be 
found in Sambrook et al. (Molecular Cloning: A Laboratory Manual . 2nd Edition, Cold 

25 Spring Harbor Laboratory press (1989)), and other laboratory textbooks. 

Expression in procaryotes is most often carried out in £ coli with either fusion 
or non-fusion inducible expression vectors. Fusion vectors usually add a number of 
NH2 terminal amino acids to the expressed target gene. These NH2 terminal amino 
acids often are referred to as a reporter group. Such reporter groups usually serve two 

30 purposes: 1) to increase the solubility of the target recombinant protein; and 2) to aid in 
tb? purification of the target recombinant protein by acting as a ligand in affinity 
purification. Often, in fusion expression vectors, a proteolytic cleavage site is 
introduced at the junction of the reporter group and the target recombinant protein to 
enable separation of the target recombinant protein from the reporter group subsequent 

35 to purification of the fusion protein. Such enzymes, and their cognate recognition 
sequences, include Factor Xa, thrombin and enterokinase. Typical fusion expression 
vectors include pGEX (Amrad Corp., Melbourne, Australia), pMAL (New England 



WO 98/24475 



PCT/US97/22104 



-43- 

Biolabs, Beverly, MA) and pRIT5 (Pharmacia, Piscataway, NJ) which fuse glutathione 
S -transferase, maltose E binding protein, or protein A, respectively, to the target 
recombinant protein. A preferred reporter group is poly(His), which may be fused to the 
amino or carboxy terminus of the protein and which renders the recombinant fusion 
5 protein easily purifiable by metal chelate chromatography. 

Inducible non-fusion expression vectors include pTrc (Amann et al., (1988) Gene 
69:301-315) and pET 1 1 d (Studier et al., Gene Expression Technology: Methods in 
Enzvmology 185 , Academic Press, San Diego, California (1990) 60-89). While target 
gene expression relies on host RNA polymerase transcription from the hybrid trp-lac 

10 fusion promoter in pTrc, expression of target genes inserted into pETl Id relies on 
transcription from the T7 gnlO-lac 0 fusion promoter mediated by coexpressed viral 
RNA polymerase (T7 gnl). This viral polymerase is supplied by host strains 
BL21 (DE3) or HMS 1 74(DE3) from a resident X prophage harboring a T7 gnl under the 
transcriptional control of the lacUV 5 promoter. 

15 For example, a host cell transfected with a nucleic acid vector directing 

expression of a nucleotide sequence encoding an H. pylori polypeptide can be cultured 
under appropriate conditions to allow expression of the polypeptide to occur. The 
polypeptide may be secreted and isolated from a mixture of cells and medium containing 
the peptide. Alternatively, the polypeptide may be retained cytoplasmically and the cells 

20 harvested, lysed and the protein isolated. A cell culture includes host cells, media and 
other byproducts. Suitable media for cell culture are well known in the art. 

* 

Polypeptides of the invention can be isolated from cell culture medium, host cells, or 
both using techniques known in the art for purifying proteins including ion-exchange 
chromatography, gel filtration chromatography, ultrafiltration, electrophoresis, and 

25 immunoaffinity purification with antibodies specific for such polypeptides. 

Additionally, in many situations, polypeptides can be produced by chemical cleavage of 
a native protein (e.g., tryptic digestion) and the cleavage products can then be purified 
by standard techniques. 

In the case of membrane bound proteins, these can be isolated from a host cell by 

30 contacting a membrane-associated protein fraction with a detergent forming a 

solubilized complex, where the membrane-associated protein is no longer entirely 
embedded in the membrane fraction and is solubilized at least to an extent which allows 
it to be chromatographically isolated from the membrane fraction. Several different 
criteria are used for choosing a detergent suitable for solubilizing these complexes. For 

35 example, one property considered is the ability of the detergent to solubilize the H. 

pylori protein within the membrane fraction at minimal denaturation of the membrane- 
associated protein allowing for the activity or functionality of the membrane-associated 
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protein to return upon reconstitution of the protein. Another property considered when 
selecting the detergent is the critical micelle concentration (CMC) of the detergent in 
that the detergent of choice preferably has a high CMC value allowing for ease of 
removal after reconstitution. A third property considered when selecting a detergent is 
5 the hydrophobicity of the detergent. Typically, membrane-associated proteins are very 
hydrophobic and therefore detergents which are also hydrophobic, e.g., the triton series, 
would be useful for solubilizing the hydrophobic proteins. Another property important 
to a detergent can be the capability of the detergent to remove the H. pylori protein with 
minimal protein-protein interaction facilitating further purification. A fifth property of 

10 the detergent which should be considered is the charge of the detergent. For example, if 
it is desired to use ion exchange resins in the purification process then preferably 
detergent should be an uncharged detergent. Chromatographic techniques which can be 
used in the final purification step are known in the art and include hydrophobic 
interaction, lectin affinity, ion exchange, dye affinity and immunoaffinity. 

15 One strategy to maximize recombinant H. pylori peptide expression in E. coli is 

to express the protein in a host bacteria with an impaired capacity to proteolytically 
cleave the recombinant protein (Gottesman, S., Gene Expression Technology: Methods 
in Enzvmology 185, Academic Press, San Diego, California (1990) 1 19-128). Another 
strategy would be to alter the nucleic acid encoding an H. pylori peptide to be inserted 

20 into an expression vector so that the individual codons for each amino acid would be 
those preferentially utilized in highly expressed E. coli proteins (Wada et al., (1992) 
Nuc. Acids Res. 20:21 11-2118). Such alteration of nucleic acids of the invention can be 
carried out by standard DNA synthesis techniques. 

The nucleic acids of the invention can also be chemically synthesized using 

25 standard techniques. Various methods of chemically synthesizing polydeoxynucleotides 
are known, including solid-phase synthesis which, like peptide synthesis, has been fully 
automated in commercially available DNA synthesizers (See, e.g., Itakura et al. U.S. 
Patent No. 4,598,049; Caruthers et al. U.S. Patent No. 4,458,066; and Itakura U.S. 
Patent Nos. 4,401,796 and 4,373,071, incorporated by reference herein). 

30 

III. K pylori Polypeptides 

This invention encompasses isolated H. pylori polypeptides encoded by the 
disclosed K pylori genomic sequences, including the polypeptides of the invention 
contained in the Sequence Listing. Polypeptides of the invention are preferably at least 
35 5 amino acid residues in length. Using the DNA sequence information provided herein, 
the amino acid sequences of the polypeptides encompassed by the invention can be 
deduced using methods well-known in the art. It will be understood that the sequence of 
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an entire nucleic acid encoding an H. pylori polypeptide can be isolated and identified 
based on an ORF that encodes only a fragment of the cognate protein-coding region. 
This can be acheived, for example, by using the isolated nucleic acid encoding the ORF, 
or fragments thereof, to prime a polymerase chain reaction with genomic K pylori DNA 
5 as template; this is followed by sequencing the amplified product. 

The polypeptides of the invention can be isolated from wild-type or mutant K 
pylori cells or from heterologous organisms or cells (including, but not limited to, 
bacteria, yeast, insect, plant and mammalian cells) into which an K pylori nucleic acid 
has been introduced and expressed. In addition, the polypeptides can be part of 
1 0 recombinant fusion proteins. 

K pylori polypeptides of the invention can be chemically synthesized using 
commercially automated procedures such as those referenced herein. 

H. pylori polypeptides of the invention are also intended to include chimeric 
proteins and truncated proteins as decribed herein. 

15 

Chimeric H. pylori proteins 

H. pylori chimeric polypeptides comprise one or more H. pylori polypeptides 
fused together. These combined sequences can be made by combining two or more 
genes, or two or more polypeptide encoding sequences, or at least one gene and at least 

20 one polypeptide encoding sequence in tandem, and the subsequent expression of the 
encoded proteins by conventional molecular biological techniques. The combined 
nucleotide sequences may be composed of a combination of either full length K pylori 
nucleotide sequences or fragments of such sequences, e.g., fragments which contain 
immunologically relevant portions of the encoded H. pylori protein. These chimeric K 

25 pylori proteins then contain the combined or synergistic vaccine potential of each 
individual H. pylori protein sequence and can be used in vaccine formulations of the 
invention. 



Truncated gene expression and protein production 

30 K pylori proteins encoded by a given nucleotide sequence can also be used in a 

biologically active truncal id form. Such truncation can be produced, for example, by 
the elimination of either 5' and/or 3' regions of the encoding nucleotide sequence. 
These truncations can affect recombinant expression of the encoded protein and/or 
subsequent purification of the protein. For example, truncation of a nucleotide sequence 

35 encoding a predicted export sequence of a specific protein may alter expression of the 
protein. Alternatively, C-terminal truncation of an K pylori polypeptide by elimination 
of the 3' end of the nucleic acid coding region may also improve protein expression and 
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subsequent purification and use, as is outlined in Example VIII below. Deletion of 
nucleic acid regions encoding internal H. pylori protein regions can also result in 
improved protein expression, purification and/or efficacy as a vaccine candidate. 

5 IV. Identification of Nucleic Acids Encoding Vaccine Components and Targets for 
Agents Effective Against H. pylori 

The disclosed K pylori genome sequence includes segments that direct the 
synthesis of ribonucleic acids and polypeptides, as well as origins of replication, 
promoters, other types of regulatory sequences, and intergenic nucleic acids. The 

10 invention encompasses nucleic acids encoding immunogenic components of vaccines 
and targets for agents effective against H. pylori. Identification of said immunogenic 
components involved in the determination of the function of the disclosed sequences can 
be achieved using a variety of approaches. Non-limiting examples of these approaches 
are described briefly below. 

15 Homology to known sequences: Computer-assisted comparison of the disclosed 

K pylori sequences with previously reported sequences present in publicly available 
databases is useful for identifying functional H. pylori nucleic acid and polypeptide 
sequences. It will be understood that protein-coding sequences, for example, may be 
compared as a whole, and that a high degree of sequence homology between two 

20 proteins (such as, for example, >80-90%) at the amino acid level indicates that the two 
proteins also possess some degree of functional homology, such as, for example, among 
enzymes involved in metabolism, DNA synthesis, or cell wall synthesis, and proteins 
involved in transport, cell division, etc. In addition, many structural features of 
particular protein classes have been identified and correlate with specific consensus 

25 sequences, such as, for example, binding domains for nucleotides, DNA, metal ions, and 
other small molecules; sites for covalent modifications such as phosphorylation, 

■ 

acylation, and the like; sites of proteimprotein interactions, etc. These consensus 
sequences may be quite short and thus may represent only a fraction of the entire 
protein-coding sequence. Identification of such a feature in an H. pylori sequence is 

30 therefore useful in determining the function of the encoded protein and identifying 
useful targets of antibacterial drugs. 

Of particular relevance to the present invention are structural features that are 
common to secretory, transmembrane, and surface proteins, including secretion signal 
peptides and hydrophobic transmembrane domains. H. pylori proteins identified as 

35 containing putative signal sequences and/or transmembrane domains are useful as 
immunogenic components of vaccines. 
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Identification of essential genes: Nucleic acids that encode proteins essential for 
growth or viability of K pylori are preferred drug targets. H. pylori genes can be tested 
for their biological relevance to the organism by examining the effect of deleting and/or 
disrupting the genes, i.e., by so-called gene "knockout", using techniques known to 
5 those skilled in the relevant art. In this manner, essential genes may be identified. 

Strain-specific sequences: Because of the evolutionary relationship between 
different H. pylori strains, it is believed that the presently disclosed K pylori sequences 
are useful for identifying, and/or discriminating between, previously known and new H. 
pylori strains. It is believed that other H. pylori strains will exhibit at least 70% 
10 sequence homology with the presently disclosed sequence. Systematic and routine 
analyses of DNA sequences derived from samples containing H. pylori strains, and 
comparison with the present sequence allows for the identification of sequences that can 
be used to discriminate between strains, as well as those that are common to all H. pylori 
strains. In one embodiment, the invention provides nucleic acids, including probes, and 
15 peptide and polypeptide sequences that discriminate between different strains of H. 

pylori. Strain-specific components can also be identified functionally by their ability to 
elicit or react with antibodies that selectively recognize one or more K pylori strains. 

In another embodiment, the invention provides nucleic acids, including probes, 
and peptide and polypeptide sequences that are common to all H. pylori strains but are 
20 not found in other bacterial species. 

Specific Example: Determination Of Candidate Protein Antigens For Antibody And 
Vaccine Development 

The selection of candidate protein antigens for vaccine development can be 

25 derived from the nucleic acids encoding H. pylori polypeptides. First, the ORFs can be 
analyzed for homology to other known exported or membrane proteins and analyzed 
using the discriminant analysis described by Klein, et al. (Klein, P., Kanehsia, M., and 
DeLisi, C. (1985) Biochimica et Biophysica Acta 815, 468-476) for predicting exported 
and membrane proteins. 

30 Homology searches can be performed using the BLAST algorithm contained in 

the Wisconsin Sequence Analysis Package (Genetics Computer Group, Unwe.sity 
Research Park, 575 Science Drive, Madison, WI 5371 1) to compare each predicted ORF 
amino acid sequence with all sequences found in the current GenBank, SWISS-PROT 
and PIR databases. BLAST searches for local alignments between the ORF and the 

35 databank sequences and reports a probability score which indicates the probability of 
finding this sequence by chance in the database. ORF's with significant homology (e.g. 
probabilities lower than lxlO" 6 that the homology is only due to random chance) to 
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membrane or exported proteins represent protein antigens for vaccine development. 
Possible functions can be provided to H. pylori genes based on sequence homology to 
genes cloned in other organisms. 

Discriminant analysis (Klein, et al. supra) can be used to examine the ORF 
5 amino acid sequences. This algorithm uses the intrinsic information contained in the 
ORF amino acid sequence and compares it to information derived from the properties of 
known membrane and exported proteins. This comparison predicts which proteins will 
be exported, membrane associated or cytoplasmic. ORF amino acid sequences 
identified as exported or membrane associated by this algorithm are likely protein 

1 0 antigens for vaccine development. 

Surface exposed outer membrane proteins are likely to represent the best 
antigens to provide a protective immune response against H. pylori. Among the 
algorithms that can be used to aid in prediction of these outer membrane proteins include 
the presence of an amphipathic beta-sheet region at their C-terminus. This region which 

1 5 has been detected in a large number of outer membrane proteins in Gram negative 
bacteria is often characterized by hydrophobic residues (Phe or Tyr) approximately at 
positions 1, 3, 5, 7 and 9 from the C-terminus (e.g., see Figure 1, block F). Importantly, 
these sequences have not been detected at the C-termini of periplasmic proteins, thus 
allowing preliminary distinction between these classes of proteins based on primary 

20 sequence data. This phenomenon has been reported previously by Struyve et al. (J. Mol. 
5/o/.218:141-148, 1991). 

Also illustrated in Figure 1 are additional amino acid sequence motifs found in 
many outer membrane proteins of K pylori. The amino acid sequence alignment in 
Figure 1 depicts portions of the sequence of five H. pylori proteins (depicted in the 

25 single letter amino acid code) labeled with their amino acid Sequence ID Numbers and 
shown N-terminal to C-terminal, left to right. Six distinct blocks (labeled A through F) 
of similar amino acid residues are found including the distinctive hydrophobic residues 
(Phe or Tyr; F or Y according to the single letter code for amino acid residues) 
frequently found at positions near the C-terminus of outer membrane proteins. The 

30 presence of several shared motifs clearly establishes the similarity between members of 
this group of proteins. 

In addition, outer membrane proteins isolated from H. pylori frequently share a 
motif near the mature N-terminus (i.e., after processing to remove the secretion signal) 
as illustrated in the blocked amino acid residues in Figure 2. Figure 2 depicts the N- 

35 terminal portion of three H. pylori proteins (designated by their amino acid Sequence ID 
Numbers and shown N-terminal to C-terminal, left to right). 
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One skilled in the art would know that these shared sequence motifs are highly 
significant and establish a similarity among this group of proteins. 

Infrequently it is not possible to distinguish between multiple possible 
nucleotides at a given position in the nucleic acid sequence. In those cases the 
5 ambiguities are denoted by an extended alphabet as follows: 

These are the official IUPAC-IUB single-letter base codes 



Code 


Base Description 




G 


Guanine 




A 


Adenine 




T 


Thymine 




C 


Cytosine 




R 


Purine 


(A or G) 


Y 


Pyrimidine 


(C or T or U) 


M 


Amino 


(A or C) 


K 


Ketone 


(GorT) 


S 


Strong interaction 


(C or G) 


W 


Weak interaction 


(A or T) 


H 


Not-G 


(AorCorT) 


B 


Not-A 


(C or G or T) 


V 


Not-T (not-U) 


(A or C or G) 


D 


Not-C 


(A or G or T) 


N 


Any 


(A orCorGorT) 



The amino acid translations of this invention account for the ambiguity in the 
10 nucleic acid sequence by translating the ambiguous codon as the letter "X M . In all cases, 
the permissible amino acid residues at a position are clear from an examination of the 
nucleic acid sequence based on the standard genetic code. 

V. Production of Fragments and Analogs of K pylori Nucleic Acids and Polypeptides 
1 5 Based op the discovery of the H. pylori gene products of the invention provided 

in the Sequence Lsiting, one skilled in the art can alter the disclosed structure (of H. 
pylori genes), e.g., by producing fragments or analogs, and test the newly produced 
structures for activity. Examples of techniques known to those skilled in the relevant art 
which allow the production and testing of fragments and analogs are discussed below. 
20 These, or analogous methods can be used to make and screen libraries of polypeptides, 
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e.g., libraries of random peptides or libraries of fragments or analogs of cellular proteins 
for the ability to bind H. pylori polypeptides. Such screens are useful for the 
identification of inhibitors of K pylori, 

5 Generation of Fragments 

Fragments of a protein can be produced in several ways, e.g., recombinantly, by 
proteolytic digestion, or by chemical synthesis. Internal or terminal fragments of a 
polypeptide can be generated by removing one or more nucleotides from one end (for a 
terminal fragment) or both ends (for an internal fragment) of a nucleic acid which 

10 encodes the polypeptide. Expression of the mutagenized DNA produces polypeptide 
fragments. Digestion with M end-nibbling M endonucleases can thus generate DNA's 
which encode an array of fragments. DNA f s which encode fragments of a protein can 
also be generated by random shearing, restriction digestion or a combination of the 
above-discussed methods. 

15 Fragments can also be chemically synthesized using techniques known in the art 

such as conventional Merrifield solid phase f-Moc or t-Boc chemistry. For example, 
peptides of the present invention may be arbitrarily divided into fragments of desired 
length with no overlap of the fragments, or divided into overlapping fragments of a 
desired length. 

20 

Alteration of Nucleic Acids and Polypeptides: Random Methods 

Amino acid sequence variants of a protein can be prepared by random 
mutagenesis of DNA which encodes a protein or a particular domain or region of a 
protein. Useful methods include PCR mutagenesis and saturation mutagenesis. A 
25 library of random amino acid sequence variants can also be generated by the synthesis of 
a set of degenerate oligonucleotide sequences. (Methods for screening proteins in a 
library of variants are elsewhere herein). 

(A) PCR Mutagenesis 

30 In PCR mutagenesis, reduced Taq polymerase fidelity is used to introduce 

random mutations into a cloned fragment of DNA (Leung et al., 1989, Technique 1:11- 
15). The DNA region to be mutagenized is amplified using the polymerase chain 
reaction (PCR) under conditions that reduce the fidelity of DNA synthesis by Taq DNA 
polymerase, e.g., by using a dGTP/dATP ratio of five and adding Mn2+ to the PCR 

35 reaction. The pool of amplified DNA fragments are inserted into appropriate cloning 
vectors to provide random mutant libraries. 
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(B) Saturation Mutagenesis 

Saturation mutagenesis allows for the rapid introduction of a large number of 
single base substitutions into cloned DNA fragments (Mayers et al., 1985, Science 
229:242). This technique includes generation of mutations, e.g., by chemical treatment 
5 or irradiation of single-stranded DNA in vitro, and synthesis of a complimentary DNA 
strand. The mutation frequency can be modulated by modulating the severity of the 
treatment, and essentially all possible base substitutions can be obtained. Because this 
procedure does not involve a genetic selection for mutant fragments both neutral 
substitutions, as well as those that alter function, are obtained. The distribution of point 
1 0 mutations is not biased toward conserved sequence elements. 

(C) Degenerate Oligonucleotides 

A library of homologs can also be generated from a set of degenerate 
oligonucleotide sequences. Chemical synthesis of a degenerate sequences can be carried 

1 5 out in an automatic DNA synthesizer, and the synthetic genes then ligated into an 

appropriate expression vector. The synthesis of degenerate oligonucleotides is known in 
the art (see for example, Narang, SA (1983) Tetrahedron 39:3; Itakura et al. (1981) 
Recombinant DNA, Proc 3rd Cleveland Sympos. Macromolecules, ed. AG Walton, 
Amsterdam: Elsevier pp273-289; Itakura etal. (1984) Annu. Rev. Biochem. 53:323; 

20 Itakura etal. (1984) Science 198:1056; Ike etal. (1983) Nucleic Acid Res. 11:477. Such 
techniques have been employed in the directed evolution of other proteins (see, for 
example, Scott et al. (1990) Science 249:386-390; Roberts et al. (1992) PNAS 89:2429- 
2433; Devlin et al. (1990) Science 249: 404-406; Cwirla et al. (1990) PNAS 87: 6378- 
6382; as well as U.S. Patents Nos. 5,223,409, 5,198,346, and 5,096,815). 

25 

Alteration of Nucleic Acids and Polypeptides: Methods for Directed Mutagenesis 

Non-random or directed, mutagenesis techniques can be used to provide specific 
sequences or mutations in specific regions. These techniques can be used to create 
variants which include, e.g., deletions, insertions, or substitutions, of residues of the 
30 known amino acid sequence of a protein. The sites for mutation can be modified 

individually or in series, e.g., by (1) substituting first with conserved amino acids and 
then with more radical choices depending upon results achieved, (2) deleting the target 
residue, or (3) inserting residues of the same or a different class adjacent to the located 
site, or combinations of options 1-3. 



35 



WO 98/24475 



PCT/US97/22104 



-52- 

(A) Alanine Scanning Mutagenesis 

Alanine scanning mutagenesis is a useful method for identification of certain 
residues or regions of the desired protein that are preferred locations or domains for 
mutagenesis, Cunningham and Wells (Science 244:1081-1085, 1989). In alanine 
5 scanning, a residue or group of target residues are identified (e.g., charged residues such 
as Arg, Asp, His, Lys, and Glu) and replaced by a neutral or negatively charged amino 
acid (most preferably alanine or polyalanine). Replacement of an amino acid can affect 
the interaction of the amino acids with the surrounding aqueous environment in or 
outside the cell. Those domains demonstrating functional sensitivity to the substitutions 

10 are then refined by introducing further or other variants at or for the sites of substitution. 
Thus, while the site for introducing an amino acid sequence variation is predetermined, 
the nature of the mutation per se need not be predetermined. For example, to optimize 
the performance of a mutation at a given site, alanine scanning or random mutagenesis 
may be conducted at the target codon or region and the expressed desired protein subunit 

15 variants are screened for the optimal combination of desired activity. 

(B) Qligonucleotide-Mediated Mutagenesis 

Oligonucleotide-mediated mutagenesis is a useful method for preparing 
substitution, deletion, and insertion variants of DNA, see, e.g., Adelman et al., (DNA 
2:183, 1983). Briefly, the desired DNA is altered by hybridizing an oligonucleotide 
encoding a mutation to a DNA template, where the template is the single-stranded form 
of a plasmid or bacteriophage containing the unaltered or native DNA sequence of the 
desired protein. After hybridization, a DNA polymerase is used to synthesize an entire 
second complementary strand of the template that will thus incorporate the 
oligonucleotide primer, and will code for the selected alteration in the desired protein 
DNA. Generally, oligonucleotides of at least 25 nucleotides in length are used. An 
optimal oligonucleotide will have 12 to 15 nucleotides that are completely 
complementary to the template on either side of the nucleotide(s) coding for the 
mutation. This ensures that the oligonucleotide will hybridize properly to the single- 
stranded DNA template molecule. The oligonucleotides are readily synthesized using 
techniques known in the art such a? th it described by Crea et al. (Proc. Natl Acad. Sci. 
USA, 75: 5765[1978]). 

(C) Cassette Mutagenesis 

35 Another method for preparing variants, cassette mutagenesis, is based on the 

technique described by Wells et al. (Gene, 34:3 15[ 1985]). The starting material is a 
plasmid (or other vector) which includes the protein subunit DNA to be mutated. The 
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codon(s) in the protein subunit DNA to be mutated are identified. There must be a 
unique restriction endonuclease site on each side of the identified mutation site(s). If no 
such restriction sites exist, they may be generated using the above-described 
oligonucleotide-mediated mutagenesis method to introduce them at appropriate locations 
5 in the desired protein subunit DNA. After the restriction sites have been introduced into 
the plasmid, the plasmid is cut at these sites to linearize it. A double-stranded 
oligonucleotide encoding the sequence of the DNA between the restriction sites but 
containing the desired mutation(s) is synthesized using standard procedures. The two 
strands are synthesized separately and then hybridized together using standard 
10 techniques. This double-stranded oligonucleotide is referred to as the cassette. This 
cassette is designed to have 3' and 5' ends that are comparable with the ends of the 
linearized plasmid, such that it can be directly ligated to the plasmid. This plasmid now 
contains the mutated desired protein subunit DNA sequence. 

15 (D) Combinatorial Mutagenesis 

Combinatorial mutagenesis can also be used to generate mutants (Ladner et al., 
WO 88/06630). In this method, the amino acid sequences for a group of homo logs or 
other related proteins are aligned, preferably to promote the highest homology possible. 
All of the amino acids which appear at a given position of the aligned sequences can be 

20 selected to create a degenerate set of combinatorial sequences. The variegated library of 
variants is generated by combinatorial mutagenesis at the nucleic acid level, and is 
encoded by a variegated gene library. For example, a mixture of synthetic 
oligonucleotides can be enzymatically ligated into gene sequences such that the 
degenerate set of potential sequences are expressible as individual peptides, or 

25 alternatively, as a set of larger fusion proteins containing the set of degenerate 
sequences. 

Other Modifications of H. pylori Nucleic Acids and Polypeptides 

It is possible to modify the structure of an H. pylori polypeptide for such 
30 purposes as increasing solubility, enhancing stability (e.g., shelf life ex vivo and 

resiFtar.ee to proteolytic degradation in vivo). A modified H. pylori protein or peptide 
can be produced in which the amino acid sequence has been altered, such as by amino 
acid substitution, deletion, or addition as described herein. 

An H. pylori peptide can also be modified by substitution of cysteine residues 
35 preferably with alanine, serine, threonine, leucine or glutamic acid residues to minimize 
dimerization via disulfide linkages. In addition, amino acid side chains of fragments of 



WO 98/24475 



-54- 



PCT/US97/22104 



the protein of the invention can be chemically modified. Another modification is 
cyclization of the peptide. 

In order to enhance stability and/or reactivity, an H. pylori polypeptide can be 
modified to incorporate one or more polymorphisms in the amino acid sequence of the 
5 protein resulting from any natural allelic variation. Additionally, D-amino acids, non- 
natural amino acids, or non-amino acid analogs can be substituted or added to produce a 
modified protein within the scope of this invention. Furthermore, an H. pylori 
polypeptide can be modified using polyethylene glycol (PEG) according to the method 
of A. Sehon and co-workers (Wie et al., supra) to produce a protein conjugated with 

10 PEG. In addition, PEG can be added during chemical synthesis of the protein. Other 
modifications of K pylori proteins include reduction/alky lation (Tarr, Methods of 
Protein Microcharacterization, J. E. Silver ed., Humana Press, Clifton NJ 155-194 
(1986)); acylation (Tarr, supra); chemical coupling to an appropriate carrier (Mishell and 
Shiigi, eds, Selected Methods in Cellular Immunology, WH Freeman, San Francisco, CA 

15 (1 980), U.S. Patent 4,939,239; or mild formalin treatment (Marsh, ( 1 97 1 ) Int. Arch, of 
Allergy and Appl Immunol. 41: 199-215). 

To facilitate purification and potentially increase solubility of an H. pylori 
protein or peptide, it is possible to add an amino acid fusion moiety to the peptide 
backbone. For example, hexa-histidine can be added to the protein for purification by 

20 immobilized metal ion affinity chromatography (Hochuli, E. et al., (1988) 

Bio/Technology \ 6: 1321 - 1325). In addition, to facilitate isolation of peptides free of 
irrelevant sequences, specific endoprotease cleavage sites can be introduced between the 
sequences of the fusion moiety and the peptide. 

To potentially aid proper antigen processing of epitopes within an H. pylori 

25 polypeptide, canonical protease sensitive sites can be engineered between regions, each 
comprising at least one epitope via recombinant or synthetic methods. For example, 
charged amino acid pairs, such as KK or RR, can be introduced between regions within 
a protein or fragment during recombinant construction thereof. The resulting peptide 
can be rendered sensitive to cleavage by cathepsin and/or other trypsin-like enzymes 

30 which would generate portions of the protein containing one or more epitopes. In 

addition, such charged amino acid residues can result in an increase in the solubility of 
the peptide. 

Primary Methods for Screening Polypeptides and Analogs 
35 Various techniques are known in the art for screening generated mutant gene 

products. Techniques for screening large gene libraries often include cloning the gene 
library into replicable expression vectors, transforming appropriate cells with the 
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resulting library of vectors, and expressing the genes under conditions in which 
detection of a desired activity, e.g., in this case, binding to H. pylori polypeptide or an 
interacting protein, facilitates relatively easy isolation of the vector encoding the gene 
whose product was detected. Each of the techniques described below is amenable to 
5 high through-put analysis for screening large numbers of sequences created, e.g., by 
random mutagenesis techniques. 

(A) Two Hybrid Systems 

Two hybrid assays such as the system described above (as with the other 
10 screening methods described herein), can be used to identify polypeptides, e.g., 

fragments or analogs of a naturally-occurring H. pylori polypeptide, e.g., of cellular 
proteins, or of randomly generated polypeptides which bind to an H. pylori protein. 
(The H. pylori domain is used as the bait protein and the library of variants are expressed 
as fish fusion proteins.) In an analogous fashion, a two hybrid assay (as with the other 
1 5 screening methods described herein), can be used to find polypeptides which bind a H. 
pylori polypeptide. 

(B) Display Libraries 

In one approach to screening assays, the candidate peptides are displayed on the 

20 surface of a cell or viral particle, and the ability of particular cells or viral particles to 
bind an appropriate receptor protein via the displayed product is detected in a "panning 
assay". For example, the gene library can be cloned into the gene for a surface 
membrane protein of a bacterial cell, and the resulting fusion protein detected by 
panning (Ladner et al., WO 88/06630; Fuchs et al. (1991) Bio/Technology 9:1370-1371; 

25 and Goward et al. (1992) TIBS 18:136-140). In a similar fashion, a detectably labeled 
ligand can be used to score for potentially functional peptide homologs. Fluorescently 
labeled ligands, e.g., receptors, can be used to detect homologs which retain ligand- 
binding activity. The use of fluorescently labeled ligands, allows cells to be visually 
inspected and separated under a fluorescence microscope, or, where the morphology of 

30 the cell permits, to be separated by a fluorescence-activated cell sorter. 

A gene library can be expressed as a fusion protem r n the surface of a viral 
particle. For instance, in the filamentous phage system, foreign peptide sequences can 
be expressed on the surface of infectious phage, thereby conferring two significant 
benefits. First, since these phage can be applied to affinity matrices at concentrations 

35 well over 10^ phage per milliliter, a large number of phage can be screened at one time. 
Second, since each infectious phage displays a gene product on its surface, if a particular 
phage is recovered from an affinity matrix in low yield, the phage can be amplified by 
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another round of infection. The group of almost identical E, coli filamentous phages 
Ml 3, fd., and fl are most often used in phage display libraries. Either of the phage gill 
or gVIII coat proteins can be used to generate fusion proteins without disrupting the 
ultimate packaging of the viral particle. Foreign epitopes can be expressed at the NH2- 
5 terminal end of pill and phage bearing such epitopes recovered from a large excess of 
phage lacking this epitope (Ladner et al. PCT publication WO 90/02909; Garrard et al., 
PCT publication WO 92/09690; Marks et al. (1992) J. Biol Chem. 267:16007-16010; 
Griffiths et al. (1993) EMBO J 12:725-734; Clackson et al. (1991) Nature 352:624-628; 
and Barbas et al. (1992) PNAS 89:4457-4461). 

10 A common approach uses the maltose receptor of E. coli (the outer membrane 

protein, LamB) as a peptide fusion partner (Charbit et al. (1986) EMBO 5, 3029-3037). 
Oligonucleotides have been inserted into plasmids encoding the LamB gene to produce 
peptides fused into one of the extracellular loops of the protein. These peptides are 
available for binding to ligands, e.g., to antibodies, and can elicit an immune response 

15 when the cells are administered to animals. Other cell surface proteins, e.g., OmpA 

(Schorr et al. (1991) Vaccines 91, pp. 387-392), PhoE (Agterberg, et al. (1990) Gene 88, 
37-45), and PAL (Fuchs et al. (1991) Bio/Tech 9, 1369-1372), as well as large bacterial 
surface structures have served as vehicles for peptide display. Peptides can be fused to 
pilin, a protein which polymerizes to form the pilus-a conduit for interbacterial exchange 

20 of genetic information (Thiry et al. (1989) Appl Environ. Microbiol. 55, 984-993). 

Because of its role in interacting with other cells, the pilus provides a useful support for 
the presentation of peptides to the extracellular environment. Another large surface 
structure used for peptide display is the bacterial motive organ, the flagellum. Fusion of 
peptides to the subunit protein flagellin offers a dense array of many peptide copies on 

25 the host cells (Kuwajima et al. (1988) Bio/Tech. 6, 1080-1083). Surface proteins of 

other bacterial species have also served as peptide fusion partners. Examples include the 
Staphylococcus protein A and the outer membrane IgA protease of Neisseria (Hansson 
et al. (1992) J. Bacteriol 174, 4239-4245 and Klauser et al. (1990) EMBO J. 9, 1991- 
1999). 

30 In the filamentous phage systems and the LamB system described above, the 

physical link between the ner tide and its encoding DNA occurs by the containment of 
the DNA within a particle (cell or phage) that carries the peptide on its surface. 
Capturing the peptide captures the particle and the DNA within. An alternative scheme 
uses the DNA-binding protein Lad to form a link between peptide and DNA (Cull et al 

35 (1992) PNAS USA 89:1865-1869). This system uses a plasmid containing the Lad gene 
with an oligonucleotide cloning site at its 3'-end. Under the controlled induction by 
arabinose, a LacI -peptide fusion protein is produced. This fusion retains the natural 
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ability of LacI to bind to a short DNA sequence known as LacO operator (LacO). By 
installing two copies of LacO on the expression plasmid, the Lacl-peptide fusion binds 
tightly to the plasmid that encoded it. Because the plasmids in each cell contain only a 
single oligonucleotide sequence and each cell expresses only a single peptide sequence, 
5 the peptides become specifically and stably associated with the DNA sequence that 
directed its synthesis. The cells of the library are gently lysed and the peptide-DNA 
complexes are exposed to a matrix of immobilized receptor to recover the complexes 
containing active peptides. The associated plasmid DNA is then reintroduced into cells 
for amplification and DNA sequencing to determine the identity of the peptide ligands. 

10 As a demonstration of the practical utility of the method, a large random library of 
dodecapeptides was made and selected on a monoclonal antibody raised against the 
opioid peptide dynorphin B. A cohort of peptides was recovered, all related by a 
consensus sequence corresponding to a six-residue portion of dynorphin B. (Cull et al. 
(1992) Proc. Natl Acad. Sci. U.SA. 89-1869) 

15 This scheme, sometimes referred to as peptides-on-plasmids, differs in two 

important ways from the phage display methods. First, the peptides are attached to the 
C-terminus of the fusion protein, resulting in the display of the library members as 
peptides having free carboxy termini. Both of the filamentous phage coat proteins, pill 
and pVIII, are anchored to the phage through their C-termini, and the guest peptides are 

20 placed into the outward-extending N-terminal domains. In some designs, the phage- 
displayed peptides are presented right at the amino terminus of the fusion protein. 
(Cwirla, et al. (1990) Proc. Natl. Acad. Sci. U.S.A. 87, 6378-6382) A second difference 
is the set of biological biases affecting the population of peptides actually present in the 
libraries. The LacI fusion molecules are confined to the cytoplasm of the host cells. 

25 The phage coat fusions are exposed briefly to the cytoplasm during translation but are 
rapidly secreted through the inner membrane into the periplasmic compartment, 
remaining anchored in the membrane by their C-terminal hydrophobic domains, with the 
N-termini, containing the peptides, protruding into the periplasm while awaiting 
assembly into phage particles. The peptides in the LacI and phage libraries may differ 

30 significantly as a result of their exposure to different proteolytic activities. The phage 
coat proteins require transport across the inner membrane and signal peptidase 
processing as a prelude to incorporation into phage. Certain peptides exert a deleterious 
effect on these processes and are underrepresented in the libraries (Gallop et al. (1994) J. 
Med. Chem. 37(9):1233-1251). These particular biases are not a factor in the LacI 

35 display system. 

The number of small peptides available in recombinant random libraries is 
enormous. Libraries of 10?- 10^ independent clones are routinely prepared. Libraries as 
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large as 10^ recombinants have been created, but this size approaches the practical 
limit for clone libraries. This limitation in library size occurs at the step of transforming 
the DNA containing randomized segments into the host bacterial cells. To circumvent 
this limitation, an in vitro system based on the display of nascent peptides in polysome 
5 complexes has recently been developed. This display library method has the potential of 

4 

producing libraries 3-6 orders of magnitude larger than the currently available 
phage/phagemid or plasmid libraries. Furthermore, the construction of the libraries, 
expression of the peptides, and screening, is done in an entirely cell-free format. 
In one application of this method (Gallop et al. (1994) J. Med Chem. 

10 37(9):1233-1251), a molecular DNA library encoding 10 12 decapeptides was 
constructed and the library expressed in an £ coli S30 in vitro coupled 
transcription/translation system. Conditions were chosen to stall the ribosomes on the 
mRNA, causing the accumulation of a substantial proportion of the RNA in polysomes 
and yielding complexes containing nascent peptides still linked to their encoding RNA. 

1 5 The polysomes are sufficiently robust to be affinity purified on immobilized receptors in 
much the same way as the more conventional recombinant peptide display libraries are 
screened. RNA from the bound complexes is recovered, converted to cDNA, and 
amplified by PCR to produce a template for the next round of synthesis and screening. 
The polysome display method can be coupled to the phage display system. Following 

20 several rounds of screening, cDNA from the enriched pool of polysomes was cloned into 
a phagemid vector. This vector serves as both a peptide expression vector, displaying 
peptides fused to the coat proteins, and as a DNA sequencing vector for peptide 
identification. By expressing the polysome-derived peptides on phage, one can either 
continue the affinity selection procedure in this format or assay the peptides on 

25 individual clones for binding activity in a phage ELISA, or for binding specificity in a 
completion phage ELISA (Barret, et al. (1992) Anal Biochem 204,357-364). To 
identify the sequences of the active peptides one sequences the DNA produced by the 
phagemid host, 

30 Secondary Screening of Polypeptides and Analogs 

The high through-put assays described above can be followed by secondar/ 
screens in order to identify further biological activities which will, e.g., allow one skilled 
in the art to differentiate agonists from antagonists. The type of a secondary screen used 
will depend on the desired activity that needs to be tested. For example, an assay can be 

35 developed in which the ability to inhibit an interaction between a protein of interest and 
its respective ligand can be used to identify antagonists from a group of peptide 
fragments isolated though one of the primary screens described above. 
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Therefore, methods for generating fragments and analogs and testing them for 
activity are known in the art. Once the core sequence of interest is identified, it is 
routine for one skilled in the art to obtain analogs and fragments. 

5 Peptide Mimetics of H. pylori Polypeptides 

The invention also provides for reduction of the protein binding domains of the 
subject H. pylori polypeptides to generate mimetics, e.g. peptide or non-peptide agents. 
The peptide mimetics are able to disrupt binding of a polypeptide to its counter ligand, 
e.g., in the case of an H. pylori polypeptide binding to a naturally occurring ligand. The 

1 0 critical residues of a subject H. pylori polypeptide which are involved in molecular 
recognition of a polypeptide can be determined and used to generate K pylori-derived 
peptidomimetics which competitively or noncompetitively inhibit binding of the K 
pylori polypeptide with an interacting polypeptide (see, for example, European patent 
applications EP-412,762A and EP-B31,080A). 

1 5 For example, scanning mutagenesis can be used to map the amino acid residues 

of a particular H. pylori polypeptide involved in binding an interacting polypeptide, 
peptidomimetic compounds (e.g. diazepine or isoquinoline derivatives) can be generated 
which mimic those residues in binding to an interacting polypeptide, and which 
therefore can inhibit binding of an H. pylori polypeptide to an interacting polypeptide 

20 and thereby interfere with the function of H. pylori polypeptide. For instance, non- 
hydrolyzable peptide analogs of such residues can be generated using benzodiazepine 
(e.g., see Freidinger et al. in Peptides: Chemistry and Biology, G.R. Marshall ed., 
ESCOM Publisher: Leiden, Netherlands, 1988), azepine (e.g., see Huffman et al. in 
Peptides: Chemistry and Biology, G.R. Marshall ed., ESCOM Publisher: Leiden, 

25 Netherlands, 1988), substituted gama lactam rings (Garvey et al. in Peptides: Chemistry 
and Biology, G.R. Marshall ed., ESCOM Publisher: Leiden, Netherlands, 1988), keto- 
methylene pseudopeptides (Ewenson et al. (1986) J Med Chem 29:295: and Ewenson et 
al. in Peptides: Structure and Function (Proceedings of the 9th American Peptide 
Symposium) Pierce Chemical Co. Rockland, IL, 1985), p-turn dipeptide cores (Nagai et 

30 al. (1985) Tetrahedron Lett 26:647; and Sato et al. (1986) J Chem Soc Perkin Trans 
1:1231), and p-aminoalcohols (Gordon et al. (1°85; Biochem Biophys Res 
Communl26:4\9; and Dann et al. (1986) Biochem Biophys Res Commun 134:71). 

VI. Vaccine Formulations for K pylori Nucleic Acids and Polypeptides 
35 This invention also features vaccine compositions or formulations (used 

interchangeably herein) for protection against infection by H. pylori or for treatment of 
K pylori infection. As used herein, the term "treatment of K pylori infection" refers to 
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therapeutic treatment of an existing or established K pylori infection. The terms 
"protection against K pylori infection" or "prophylactic treatment" refer to the use of H. 
pylori vaccine formulation for reducing the risk of or preventing an infection in a subject 
at risk for H. pylori infection. In one embodiment, the vaccine compositions contain one 
5 or more immunogenic components, such as a surface protein, from H. pylori, or portion 
thereof, and a pharmaceutical Iy acceptable carrier. For example, in one embodiment, the 
vaccine formulations of the invention contain at least one or combination of H. pylori 
polypeptides or fragments thereof, from same or different H. pylori antigens. Nucleic 
acids and H. pylori polypeptides for use in the vaccine formulations of the invention 

10 include the nucleic acids and polypeptides set forth in the Sequence Listing, preferably 
those H. pylori nucleic acids that encode surface proteins and surface proteins or 
fragments thereof. For example, a preferred nucleic acid and H. pylori polypeptide for 
use in a vaccine composition of the invention is selected from the group of nucleic acids 
which encode cell envelope proteins and H. pylori cell envelope proteins as set forth in 

15 Table 1 . However, any nucleic acid encoding an immunogenic H. pylori protein and H, 
pylori polypetide, or portion thereof, can be used in the present invention. These 
vaccines have therapeutic and/or prophylactic utilities. 

One aspect of the invention provides a vaccine composition for protection 
against infection by H. pylori which contains at least one immunogenic fragment of an 

20 H. pylori protein and a pharmaceutical^ acceptable carrier. Preferred fragments include 
peptides of at least about 10 amino acid residues in length, preferably about 10-20 amino 
acid residues in length, and more preferably about 12-16 amino acid residues in length. 

Immunogenic components of the invention can be obtained, for example, by 
screening polypeptides recombinantly produced from the corresponding fragment of the 

25 nucleic acid encoding the full-length H. pylori protein. In addition, fragments can be 
chemically synthesized using techniques known in the art such as conventional 
Merrifield solid phase f-Moc or t-Boc chemistry. 

In one embodiment, immunogenic components are identified by the ability of the 
peptide to stimulate T cells. Peptides which stimulate T cells, as determined by, for 

30 example, T cell proliferation or cytokine secretion are defined herein as comprising at 
least one T cell enitcpe. T cell epitopes are believed to be involved in initiation and 
perpetuation of the immune response to the protein allergen which is responsible for the 
clinical symptoms of allergy. These T cell epitopes are thought to trigger early events at 
the level of the T helper cell by binding to an appropriate HLA molecule on the surface 

35 of an antigen presenting cell, thereby stimulating the T cell subpopulation with the 
relevant T cell receptor for the epitope. These events lead to T cell proliferation, 
lymphokine secretion, local inflammatory reactions, recruitment of additional immune 
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cells to the site of antigenyT cell interaction, and activation of the B cell cascade, leading 
to the production of antibodies. A T cell epitope is the basic element, or smallest unit of 
recognition by a T cell receptor, where the epitope comprises amino acids essential to 
receptor recognition (e.g., approximately 6 or 7 amino acid residues). Amino acid 
5 sequences which mimic those of the T cell epitopes are within the scope of this 
invention. 

In another embodiment, immunogenic components of the invention are identified 
through genomic vaccination. The basic protocol is based on the idea that expression 
libraries consisting of all or parts of a pathogen genome, e.g., an H. pylori genome, can 

10 confer protection when used to genetically immunize a host. This expression library 

immunization (ELI) is analogous to expression cloning and involves reducing a genomic 
expression library of a pathogen, e.g., H. pylori, into plasmids that can act as genetic 
vaccines. The plasmids can also be designed to encode genetic adjuvants which can 
dramatically stimulate the humoral response. These genetic adjuvants can be introduced 

15 at remote sites and act as well extracelluraly as intracellularly. 

This is a new approach to vaccine production that has many of the advantages of 
live/attenuated pathogens but no risk of infection. An expression library of pathogen 
DNA is used to immunize a host thereby producing the effects of antigen presentation of 
a live vaccine without the risk. For example, in the present invention, random fragments 

20 from the H. pylori genome or from cosmid or plasmid clones, as well as PCR products 
from genes identified by genomic sequencing, can be used to immunize a host. The 
feasibility of this approach has been demonstrated with Mycoplasma pulmonis (Barry et 
al., Nature 377:632-635, 1995), where even partial expression libraries of Mycoplasma 
pulmonis, a natural pathogen in rodents, provided protection against challenge from the 

25 pathogen. 

ELI is a technique that allows for production of a non-infectious multipartite 
vaccine, even when little is known about pathogen's biology, because ELI uses the 
immune system to screen candidate genes. Once isolated, these genes can be used as 
genetic vaccines or for development of recombinant protein vaccines. Thus, ELI allows 

30 for production of vaccines in a systematic, largely mechanized fashion. 

Screening immunogenic components can be accomplished using one or more of 
several different assays. For example, in vitro, peptide T cell stimulatory activity is 
assayed by contacting a peptide known or suspected of being immunogenic with an 
antigen presenting cell which presents appropriate MHC molecules in a T cell culture. 

35 Presentation of an immunogenic H. pylori peptide in association with appropriate MHC 
molecules to T cells in conjunction with the necessary costimulation has the effect of 
transmitting a signal to the T cell that induces the production of increased levels of 
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cytokines, particularly of interleukin-2 and interleukin-4. The culture supernatant can be 
obtained and assayed for interleukin-2 or other known cytokines. For example, any one 
of several conventional assays for interleukin-2 can be employed, such as the assay 
described in Proc. Natl. Acad Sci USA, 86: 1333 (1989) the pertinent portions of which 
5 are incorporated herein by reference. A kit for an assay for the production of interferon 
is also available from Genzyme Corporation (Cambridge, MA). 

Alternatively, a common assay for T cell proliferation entails measuring tritiated 
thymidine incorporation. The proliferation of T cells can be measured in vitro by 
determining the amount of ^H-labeled thymidine incorporated into the replicating DNA 

10 of cultured cells. Therefore, the rate of DNA synthesis and, in turn, the rate of cell 
division can be quantified. 

Vaccine compositions or formulations of the invention containing one or more 
immunogenic components (e.g., K pylori polypeptide or fragment thereof or nucleic 
acid encoding an K pylori polypeptide or fragment thereof) preferably include a 

15 pharmaceutical ly acceptable carrier. The term "pharmaceutically acceptable carrier" is 
intended to include any and all solvents, dispersion media, coatings, antibacterial and 
antifungal agents, isotonic and absorption delaying agents, and the like, compatible with 
pharmaceutical administration. Suitable pharmaceutical Iy acceptable carriers include, 
for example, one or more of water, saline, phosphate buffered saline, dextrose, glycerol, 

20 ethanol and the like, as well as combinations thereof. Pharmaceutical ly acceptable 

carriers may further comprise minor amounts of auxiliary substances such as wetting or 
emulsifying agents, preservatives or buffers, which enhance the shelf life or 
effectiveness of the H. pylori nucleic acid or polypeptide. For vaccine formulations of 
the invention containing H. pylori polypeptides, the polypeptide is preferably 

25 coadministered with a suitable adjuvant and/or a delivery system described herein. 

It will be apparent to those of skill in the art that the therapeutically effective 
amount of DNA or protein of this invention will depend, inter alia, upon the 
administration schedule, the unit dose of an K pylori nucleic acid or polypeptide 
administered, whether the protein or nucleic acid is administered in combination with 

30 other therapeutic agents, the immune status and health of the patient, and the therapeutic 
activity of the particular protein or nucleic acid. 

Vaccine formulations are conventionally administered parenterally, e.g., by 
injection, either subcutaneously or intramuscularly. Methods for intramuscular 
immunization are described by Wolff et al. (1990) Science 247: 1465-1468 and by 

3 5 Sedegah et al. ( 1 994) Immunology 91: 9866-9870. Other modes of administration 

include oral and pulmonary formulations, suppositories, and transdermal applications. 
Oral immunization is preferred over parenteral methods for inducing protection against 
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infection by K pylori. Czinn et. al. (1993) Vaccine U : 637-642. Oral formulations 
include such normally employed excipients as, for example, pharmaceutical grades of 
mannitol, lactose, starch, magnesium stearate, sodium saccharine, cellulose, magnesium 
carbonate, and the like. 
5 In one embodiment, the vaccine formulation includes, as a pharmaceutically 

acceptable carrier, an adjuvant. Examples of the suitable adjuvants for use in the 
vaccine formulations of the invention include, but are not limited, to aluminum 
hydroxide; N-acetyl-muramyl-L-threonyl-D-isoglutamine (thr-MDP); N-acetyl-nor- 
muramyl-L-alanyl-D-isoglutamine (CGP 1 1637, referred to as nor-MDP); N- 
1 0 acety lmuramyl-L-alanyl-D-isoglutaminy l-L-alanine-2-( 1 , -2 , -dipalmitoyl-sn-glycero-3- 
hydroxyphos-phoryloxy)-ethylamine (CGP 19835 A, referred to a MTP-PE); RIBI, 
which contains three components from bacteria; monophosphoryl lipid A; trehalose 
dimycoloate; cell wall skeleton (MPL + TDM + CWS) in a 2% squalene/Tween 80 
emulsion; and cholera toxin. Others which may be used are non-toxic derivatives of 
1 5 cholera toxin, including its B subunit, and/or conjugates or genetically engineered 
fusions of the H. pylori polypeptide with cholera toxin or its B subunit, 
procholeragenoid, fungal polysaccharides, including schizophyllan, muramyl dipeptide, 
muramyl dipeptide derivatives, phorbol esters, labile toxin of E. coli, non-//. pylori 
bacterial lysates, block polymers or saponins. 
20 In another embodiment, the vaccine formulation includes, as a pharmaceutically 

acceptable carrier, a delivery system. Suitable delivery systems for use in the vaccine 
formulations of the invention include biodegradable microcapsules or immuno- 
stimulating complexes (ISCOMs), cochleates, or liposomes, genetically engineered 
attenuated live vectors such as viruses or bacteria, and recombinant (chimeric) virus-like 
25 particles, e.g., bluetongue. In another embodiment of the invention, the vaccine 
formulation includes both a delivery system and an adjuvant. 

Delivery systems in humans may include enteric release capsules protecting the 
antigen from the acidic environment of the stomach, and including //. pylori polypeptide 
in an insoluble form as fusion proteins. Suitable carriers for the vaccines of the 
30 invention are enteric coated capsules and polylactide-glycolide microspheres. Suitable 
diluents are 0.2 N NaHC03 and/or saline. 

Vaccines of the invention can be administered as a primary prophylactic agent in 
adults or in children, as a secondary prevention, after successful eradication of H. pylori 
in an infected host, or as a therapeutic agent in the aim to induce an immune response in 
35 a susceptible host to prevent infection by K pylori The vaccines of the invention are 
administered in amounts readily determined by persons of ordinary skill in the art. 
Thus, for adults a suitable dosage will be in the range of 10 \ig to 10 g, preferably 10 \xg 
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to 100 mg, for example 50 \ig to 50 mg. A suitable dosage for adults will also be in the 
range of 5 (ig to 500 mg. Similar dosage ranges will be applicable for children. 

The amount of adjuvant employed will depend on the type of adjuvant used. For 
example, when the mucosal adjuvant is cholera toxin, it is suitably used in an amount of 
5 5 jig to 50 ng, for example 10 |ig to 35 \xg. When used in the form of microcapsules, the 
amount used will depend on the amount employed in the matrix of the microcapsule to 
achieve the desired dosage. The determination of this amount is within the skill of a 
person of ordinary skill in the art. 

Those skilled in the art will recognize that the optimal dose may be more or less 

10 depending upon the patient's body weight, disease, the route of administration, and other 
factors. Those skilled in the art will also recognize that appropriate dosage levels can be 
obtained based on results with known oral vaccines such as, for example, a vaccine 
based on an E. coli lysate (6 mg dose daily up to total of 540 mg) and with an 
enterotoxigenic E. coli purified antigen (4 doses of 1 mg) (Schulman et al., J. Urol. 

15 150:917-921 (1993)); Boedecker et al., American Gastroenterological Assoc. 999:A-222 
(1993)). The number of doses will depend upon the disease, the formulation, and 
efficacy data from clinical trials. Without intending any limitation as to the course of 
treatment, the treatment can be administered over 3 to 8 doses for a primary 
immunization schedule over 1 month (Boedeker, American Gastroenterological Assoc. 

20 888:A-222(1993)). 

In a preferred embodiment, a vaccine composition of the invention can be based 
on a killed whole £ coli preparation with an immunogenic fragment of an H. pylori 
protein of the invention expressed on its surface or it can be based on an E. coli lysate, 
wherein the killed E. coli acts as a carrier or an adjuvant. 

25 It will be apparent to those skilled in the art that some of the vaccine 

compositions of the invention are useful only for preventing H. pylori infection, some 
are useful only for treating H. pylori infection, and some are useful for both preventing 
and treating H. pylori infection. In a preferred embodiment, the vaccine composition of 
the invention provides protection against H. pylori infection by stimulating humoral 

30 and/or cell-mediated immunity against H. pylori. It should be understood that 

amelioratio i of any of the symptoms of H. pylori infection is a desirable clinical goal, 
including a lessening of the dosage of medication used to treat H. pylori-caused disease, 
or an increase in the production of antibodies in the serum or mucous of patients. 
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VII. Antibodies Reactive With H. pylori Polypeptides 

The invention also includes antibodies specifically reactive with the subject H. 
pylori polypeptide. Anti-protein/anti-peptide antisera or monoclonal antibodies can be 
made by standard protocols (See, for example, Antibodies: A Laboratory Manual ed. by 
5 Harlow and Lane (Cold Spring Harbor Press: 1 988)). A mammal such as a mouse, a 
hamster or rabbit can be immunized with an immunogenic form of the peptide. 
Techniques for conferring immunogenicity on a protein or peptide include conjugation 
to carriers or other techniques well known in the art. An immunogenic portion of the 
subject H. pylori polypeptide can be administered in the presence of adjuvant. The 

10 progress of immunization can be monitored by detection of antibody titers in plasma or 
serum. Standard ELISA or other immunoassays can be used with the immunogen as 
antigen to assess the levels of antibodies. 

In a preferred embodiment, the subject antibodies are immunospecific for 
antigenic determinants of the H. pylori polypeptides of the invention, e.g. antigenic 

15 determinants of a polypeptide of the invention contained in the Sequence Listing, or a 
closely related human or non-human mammalian homolog (e.g., 90% homologous, more 
preferably at least 95% homologous). In yet a further preferred embodiment of the 
invention, the anti-//. pylori antibodies do not substantially cross react (i.e., react 
specifically) with a protein which is for example, less than 80% percent homologous to a 

20 sequence of the invention contained in the Sequence Listing. By "not substantially cross 
react' 1 , it is meant that the antibody has a binding affinity for a non-homologous protein 
which is less than 10 percent, more preferably less than 5 percent, and even more 
preferably less than 1 percent, of the binding affinity for a protein of the invention 
contained in the Sequence Listing. In a most preferred embodiment, there is no 

25 crossreactivity between bacterial and mammalian antigens. 

The term antibody as used herein is intended to include fragments thereof which 
are also specifically reactive with H. pylori polypeptides. Antibodies can be fragmented 
using conventional techniques and the fragments screened for utility in the same manner 
as described above for whole antibodies. For example, F(ab')2 fragments can be 

30 generated by treating antibody with pepsin. The resulting F(ab')2 fragment can be 
treated to reduce disulfide bridges to produce Fab' fragments. The antibody of the 
invention is further intended to include bispecific and chimeric molecules having an 
anti-//. pylori portion. 

Both monoclonal and polyclonal antibodies (Ab) directed against H. pylori 

35 polypeptides or H. pylori polypeptide variants, and antibody fragments such as Fab' and 
F(ab*)2, can be used to block the action of K pylori polypeptide and allow the study of 
the role of a particular H. pylori polypeptide of the invention in aberrant or unwanted 
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intracellular signaling, as well as the normal cellular function of the H. pylori and by 
microinjection of anti-//. pylori polypeptide antibodies of the present invention. 

Antibodies which specifically bind H. pylori epitopes can also be used in 
immunohistochemical staining of tissue samples in order to evaluate the abundance and 
5 pattern of expression of H. pylori antigens. Anti H. pylori polypeptide antibodies can be 
used diagnostically in immuno-precipitation and immuno-blotting to detect and evaluate 
H. pylori levels in tissue or bodily fluid as part of a clinical testing procedure. Likewise, 
the ability to monitor H. pylori polypeptide levels in an individual can allow 
determination of the efficacy of a given treatment regimen for an individual afflicted 

10 with such a disorder. The level of an H. pylori polypeptide can be measured in cells 
found in bodily fluid, such as in urine samples or can be measured in tissue, such as 
produced by gastric biopsy. Diagnostic assays using anti-//. pylori antibodies can 
include, for example, immunoassays designed to aid in early diagnosis of H. pylori 
infections. The present invention can also be used as a method of detecting antibodies 

1 5 contained in samples from individuals infected by this bacterium using specific K pylori 
antigens. 

Another application of anti-//. pylori polypeptide antibodies of the invention is 
in the immunological screening of cDNA libraries constructed in expression vectors 
such as Xgt\ 1, A.gt 18-23, AZAP, and XORF8. Messenger libraries of this type, having 

20 coding sequences inserted in the correct reading frame and orientation, can produce 
fusion proteins. For instance, Xgtl 1 will produce fusion proteins whose amino termini 
consist of B-galactosidase amino acid sequences and whose carboxy termini consist of a 
foreign polypeptide. Antigenic epitopes of a subject H. pylori polypeptide can then be 
detected with antibodies, as, for example, reacting nitrocellulose filters lifted from 

25 infected plates with anti-//. pylori polypeptide antibodies. Phage, scored by this assay, 
can then be isolated from the infected plate. Thus, the presence of H. pylori gene 
homologs can be detected and cloned from other species, and alternate isoforms 
(including splicing variants) can be detected and cloned. 

30 VIII. Kits Containing Nucleic Acids, Polypeptides or Antibodies of the Invention 

The nucleic acid, polypeptides and antibodies of the invention can be combined 
with other reagents and articles to form kits. Kits for diagnostic purposes typically 
comprise the nucleic acid, polypeptides or antibodies in vials or other suitable vessels. 
Kits typically comprise other reagents for performing hybridization reactions, 

35 polymerase chain reactions (PCR), or for reconstitution of lyophilized components, such 
as aqueous media, salts, buffers, and the like. Kits may also comprise reagents for 
sample processing such as detergents, chaotropic salts and the like. Kits may also 



WO 98/24475 



PCT/US97/22104 



-67- 

comprise immobilization means such as particles, supports, wells, dipsticks and the like. 
Kits may also comprise labeling means such as dyes, developing reagents, radioisotopes, 
fluorescent agents, luminescent or chemiluminescent agents, enzymes, intercalating 
agents and the like. With the nucleic acid and amino acid sequence information provided 
5 herein, individuals skilled in art can readily assemble kits to serve their particular 
purpose. Kits further can include instructions for use. 

IX. Drug Screening Assays Using H. pylori Polypeptides 

By making available purified and recombinant H. pylori polypeptides, the 

10 present invention provides assays which can be used to screen for drugs which are either 
agonists or antagonists of the normal cellular function, in this case, of the subject H. 
pylori polypeptides, or of their role in intracellular signaling. Such inhibitors or 
potentiators may be useful as new therapeutic agents to combat H. pylori infections in 
humans. A variety of assay formats will suffice and, in light of the present inventions, 

15 will be comprehended by the skilled artisan. 

In many drug screening programs which test libraries of compounds and natural 
extracts, high throughput assays are desirable in order to maximize the number of 
compounds surveyed in a given period of time. Assays which are performed in cell-free 
systems, such as may be derived with purified or semi-purified proteins, are often 

20 preferred as "primary" screens in that they can be generated to permit rapid development 
and relatively easy detection of an alteration in a molecular target which is mediated by 
a test compound. Moreover, the effects of cellular toxicity and/or bioavailability of the 
test compound can be generally ignored in the in vitro system, the assay instead being 
focused primarily on the effect of the drug on the molecular target as may be manifest in 

25 an alteration of binding affinity with other proteins or change in enzymatic properties of 
the molecular target. Accordingly, in an exemplary screening assay of the present 
invention, the compound of interest is contacted with an isolated and purified H. pylori 
polypeptide. 

Screening assays can be constructed in vitro with a purified H. pylori 
30 polypeptide or fragment thereof, such as an H. pylori polypeptide having enzymatic 

activity, such that the activity of .he polypeptide produces a detectable reaction product. 
The efficacy of the compound can be assessed by generating dose response curves from 
data obtained using various concentrations of the test compound. Moreover, a control 
assay can also be performed to provide a baseline for comparison. Suitable products 
35 include those with distinctive absorption, fluorescence, or chemi-luminescence 
properties, for example, because detection may be easily automated. A variety of 
synthetic or naturally occurring compounds can be tested in the assay to identify those 
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which inhibit or potentiate the activity of the K pylori polypeptide. Some of these 
active compounds may directly, or with chemical alterations to promote membrane 
permeability or solubility, also inhibit or potentiate the same activity (e.g., enzymatic 
activity) in whole, live H. pylori cells. 

5 

This invention is further illustrated by the following examples which should not 
be construed as limiting. The contents of all references and published patent 
applications cited throughout this application are hereby incorporated by reference. 



10 EXEMPLIFICATION 

I. Cloning and Sequencing of H. pylori DNA 

H. pylori chromosomal DNA was isolated according to a basic DNA protocol 
outlined in Schleif R.F. and Wensink P.C., Practical Methods in Molecular Biology, 

15 p.98, Springer- Verlag, NY., 1981, with minor modifications. Briefly, cells were 

pelleted, resuspended in TE (10 mM Tris, 1 mM EDTA, pH 7.6) and GES lysis buffer 
(5.1 M guanidium thiocyanate, 0.1 M EDTA, pH 8.0, 0.5% N-laurylsarcosine) was 
added. Suspension was chilled and ammonium acetate (NH 4 Ac) was added to final 
concentration of 2.0 M. DNA was extracted, first with chloroform, then with phenol- 

20 chloroform, and reextracted with chloroform. DNA was precipitated with isopropanol, 
washed twice with 70% EtOH, dried and resuspended in TE. 

Following isolation whole genomic H. pylori DNA was nebulized (Bodenteich et 
al., Automated DNA Sequencing and Analysis (J.C. Venter, ed.), Academic Press, 1994) 
to a median size of 2000 bp. After nebulization, the DNA was concentrated and 

25 separated on a standard 1% agarose gel. Several fractions, corresponding to 

approximate sizes 900-1300 bp, 1300-1700 bp, 1700-2200 bp, 2200-2700 bp, were 
excised from the gel and purified by the GeneClean procedure (Biol 01, Inc.). 

The purified DNA fragments were then blunt-ended using T4 DNA polymerase. 
The healed DNA was then ligated to unique BstXI-linker adapters in 100-1000 fold 

30 molar excess. These linkers are complimentary to the BstXI-cut pMPX vectors, while 
the overhang is not self-complimentary. Therefore, the linkers will not concatemerize 
nor will the cut-vector religate itself easily. The linker-adopted inserts were separated 
from the unincorporated linkers on a 1% agarose gel and purified using GeneClean. The 
linker-adopted inserts were then ligated to each of the 20 pMPX vectors to construct a 

35 series of "shotgun" subclone libraries. The vectors contain an out-of-frame lacZ gene at 
the cloning site which becomes in-frame in the event that an adapter-dimer is cloned, 
allowing these to be avoided by their blue-color. 
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All subsequent steps were based on the multiplex DNA sequencing protocols 
outlined in Church G.M. and Kieffer-Higgins S., Science 240:185-188, 1988. Only 
major modifications to the protocols are highlighted. Briefly, each of the 20 vectors was 
then transformed into DH5a competent cells (Gibco/BRL, DH5a transformation 
5 protocol). The libraries were assessed by plating onto antibiotic plates containing 
ampicillin, methicillin and IPTG/Xgal. The plates were incubated overnight at 37°C. 
Successful transformants were then used for plating of clones and pooling into the 
multiplex pools. The clones were picked and pooled into 40 ml growth medium 
cultures. The cultures were grown overnight at 37°C. DNA was purified using the 

10 Qiagen Midi-prep kits and Tip- 100 columns (Qiagen, Inc.). In this manner, 100 jig of 
DNA was obtained per pool. Fifteen 96-well plates of DNA were generated to obtain a 
5-10 fold sequence redundancy assuming 250-300 base average read-lengths. 

These purified DNA samples were then sequenced using the multiplex DNA 
sequencing based on chemical degradation methods (Church G.M. and Kieffer-Higgins 

15 S., Science 240:185-188, 1988) or by Sequithrem (Epicenter Technologies) dideoxy 
sequencing protocols. The sequencing reactions were electrophoresed and transferred 
onto nylon membranes by direct transfer electrophoresis from 40 cm gels (Richterich P. 
and Church G.M., Methods in Enzymology 218:187-222, 1993) or by electroblotting 
(Church, supra). 24 samples were run per gel. 45 successful membranes were produced 

20 by chemical sequencing and 8 were produced by dideoxy sequencing. The DNA was 
covalently bound to the membranes by exposure to ultraviolet light, and hybridized with 
labeled oligonucleotides complimentary to tag sequences on the vectors (Church, supra). 
The membranes were washed to rinse off non-specifically bound probe, and exposed to 
X-ray film to visualize individual sequence ladders. After autoradiography, the 

25 hybridized probe was removed by incubation at 65° C, and the hybridization cycle 
repeated with another tag sequence until the membrane had been probed 38 times for 
chemical sequencing membranes and 10 times for the dideoxy sequencing membranes. 
Thus, each gel produced a large number of films, each containing new sequencing 
information. Whenever a new blot was processed, it was initially probed for an internal 

30 standard sequence added to each of the pools. 

Digital images of the films were generated using a laser-scanning densitometer 
(Molecular Dynamics, Sunnyvale, CA). The digitized images were processed on 
computer workstations (VaxStation 4000's) using the program REPLICA™ (Church et 
al., Automated DNA Sequencing and Analysis (J.C. Venter, ed.), Academic Press, 1994). 

35 Image processing included lane straightening, contrast adjustment to smooth out 

intensity differences, and resolution enhancement by iterative gaussian deconvolution. 
The sequences were then automatically picked in REPLICA™ and displayed for 
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interactive proofreading before being stored in a project database. The proofreading was 
accomplished by a quick visual scan of the film image followed by mouse clicks on the 
bands of the displayed image to modify the base calls. Many of the sequence errors 
could be detected and corrected because multiple sequence reads covering the same 
5 portion of the genomic DNA provide adequate sequence redundancy for editing. Each 
sequence automatically received an identification number (corresponding to microtiter 
plate, probe information, and lane set number). This number serves as a permanent 
identifier of the sequence so it is always possible to identify the original of any 
particular sequence without recourse to a specialized database. 

10 Routine assembly of H. pylori sequences was done using the program FALCON 

(Church, Church et ah, Automated DNA Sequenicng and Analysis (J.C. Venter, ed.), 
Academic Press, 1 994). This program has proven to be fast and reliable for most 
sequences. The assembled contigs were displayed using a modified version of 
GelAssemble, developed by the Genetics Computer Group (GCG) (Devereux et al., 

15 Nucleic Acid Res. 12:387-95, 1984) that interacts with REPLICA™. This provided for 
an integrated editor that allows multiple sequence gel images to be instantaneously 
called up from the REPLICA™ database and displayed to allow rapid scanning of 
contigs and proofreading of gel traces where discrepancies occurred between different 
sequence reads in the assembly. 

20 

II. Identification, cloning and expression of recombinant H. pylori DNA sequences 

To facilitate the cloning, expression and purification of membrane and secreted 
proteins from H. pylori a powerful gene expression system, the pET System (Novagen), 
for cloning and expression of recombinant proteins in E. coli y was selected. Also, a 

25 DNA sequence encoding a peptide tag, the His-Tag, was fused to the 3 1 end of DNA 
sequences of interest in order to facilitate purification of the recombinant protein 
products. The 3* end was selected for fusion in order to avoid alteration of any 5' 
terminal signal sequence. The exception to the above was ppiB, a gene cloned for use as 
a control in the expression studies. In this study, the sequence for K pylori ppiB 

30 contains a DNA sequence encoding a His-Tag fused to the 5' end of the full length gene, 
because the protein product of this gene does not contain a signal sequence and is 
expressed as a cytosolic protein. 
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PCR Amplification and cloning of DNA sequences containing ORF's for membrane and 
secreted proteins from the J99 Strain of Helicobacter pylori 

Sequences chosen (from the list of the DNA sequences of the invention) for 
cloning from the J99 strain of H. pylori were prepared for amplification cloning by 
5 polymerase chain reaction (PCR). Synthetic oligonucleotide primers (Table 3) specific 
for the 5' and 3' ends of open reading frames (ORFs) were designed and purchased 
(GibcoBRL Life Technologies, Gaithersburg, MD, USA). All forward primers (specific 
for the 5' end of the sequence) were designed to include an Ncol cloning site at the 
extreme 5' terminus, except for HpSeq. 4821082 where Ndel was used. These primers 

10 were designed to permit initiation of protein translation at a methionine residue followed 
by a valine residue and the coding sequence for the remainder of the native K pylori 
DNA sequence. An exception is H. pylori sequence 4821082 where the initiator 
methionine is immediately followed by the remainder of the native H. pylori DNA 
sequence. All reverse primers (specific for the 3' end of any H. pylori ORF) included a 

15 EcoRI site at the extreme 5' terminus to permit cloning of each K pylori sequence into 
the reading frame of the pET-28b. The pET-28b vector provides sequence encoding an 
additional 20 carboxy-terminal amino acids (only 19 amino acids in HpSeq. 26380318 
and HpSeq. 14640637) including six histidine residues (at the extreme C-terminus), 
which comprise the His-Tag. An exception to the above, as noted earlier, is the vector 

20 construction for the ppiB gene. A synthetic oligonucleotide primer specific for the 5' 
end of ppiB gene encoded a BamHI site at its extreme 5' terminus and the primer for the 
3' end of the ppiB gene encoded a Xhol site at its extreme 5' terminus. 

TABLE 3 

25 Oligonucleotide primers used for PCR amplification of//, pylori DNA sequences 



Outer membrane 
Proteins 


Forward primer 5' to 3' 


Reverse Primer 5' to 3' 


Protein 16225006 


5'-TATACCATGGTGGG 
CGCTAA-3' (SEQ ID 
NO: 195) 


5'-ATGAATTCGAGTA 
AGGATTTTTG-3' (SEQ 
ID NO: 196) 


Protein 26054702 


5-TTAACCATGGTGA 
AAAGCGATA-3 1 (SEQ ID 
NO: 197) 


5'-TAGAATTCGCATA 
ACGATC AATC-3 ' (SEQ 
ID NO: 198) 


Protein 71 16626 


5-ATATCCATGGTGA 
GTTTGATGA-3' (SEQ ID 
NO: 199) 


5'-ATGAATTCAATTT 
TTTArrn GCCA-3' 
(SEQ ID NO:200) 
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Protein 29479681 


5'-AATTCCATGGTGG 
NO:201) 


5'-ATGAATTCTCGAT 
ID NO:202) 


Protein 14640637 


5'-AATTCCATGGTG 

C A 1 AAL. 1 1 LA_ A 11-3 

(SEQ ID NO:203) 


5 '- A AG AATTCTCT A 

r*r* a i^r^r* a a atv , /~ , a t 
uLA 1 LLAAA 1 OuA-j 

(SEQ ID NO:204) 


Penplasmic/ Secreted 
Proteins 






Protein 30100332 


5'-ATTTCCATGGTCATG 
TCTCATATT-3 (SEQ ID 
NO:205) 


5'-ATGAATTCCATC 
TlTl ATTCCAC-3 
(SEQ ID NO:206) 


Protein 4721061 


5'-AACCATGGTGATTT 
TAAGCATTGAAAG-3 
(SEQ ID NO:207) 


5'-AAGAATTCCAC 

A A A A TTTTTT A A /"* 

I C AAA A I 1 1 1 1 1 A AC 
AG-3' (SEQ ID NO:208) 


Other Surface Proteins 






Protein 4821082 


5'-GATCATCCATATGTT 
ATCTTCTAAT-3' (SEQ 
ID NO:2U9) 


5'-TGAATTCAACCA 
TTTTAACCCTG-3' 

/CCA Tr\ XTO.O 1 f\\ 

(oby ID NU:21U) 


Protein 978477 


5 '-TATACC ATGGTG A A 

A TTTTTTPTTTT A 1 1 

A111111C1111 A- J 

(SEQ ID NO:211) 


S'-AGAATTCAATT 

rprTrTTr.TA a a a/* 1 
OLAj 1L1 1 0 1 AAAAO- 

3'(SEQIDNO:212) 


Inner Membrane 






Protein 26380318 


5'-TATACCATGGTGAT 
GGACAAACTC-3* (SEQ 
ID NO:213) 


5'-ATGAATTGCCACTT 
GGGGCGATA-3' (SEQ 
ID NO:214) 


Cytoplasmic Protein 






ppi 


5'-TTATGGATCCAAAC 
CAATTAAAACT-3' (SEQ 
ID NO:215) 


5'-TATCTCGAGTTATA 
GAGAAGGGC-3' (SEQ 
ID NO:216) 



Genomic DNA prepared from the J99 strain of H. pylori (ATCC #55679; 
deposited by Genome Therapeutics Corporation, 100 Beaver Street, Waltham, MA 
02154) was used as the source of template DNA for PCR amplification reactions 
5 (Current Protocols in Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et al., 
eds., 1994). To amplify a DNA sequence containing an K pylori ORF, genomic DNA 
(50 nanograms) was introduced into a reaction vial containing 2 mM MgCl2» 1 
micromolar synthetic oligonucleotide primers (forward and reverse primers) 
complementary to and flanking a defined H. pylori ORF, 0.2 mM of each 
1 0 deoxynucleotide triphosphate; dATP, dGTP, dCTP, dTTP and 2.5 units of heat stable 
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DNA polymerase (Amplitaq, Roche Molecular Systems, Inc., Branchburg, NJ, USA) in 
a final volume of 100 microliters. The following thermal cycling conditions were used 
to obtain amplified DNA products for each ORF using a Perkin Elmer Cetus/ GeneAmp 
PCR System 9600 thermal cycler: 

5 

Protein 26054702, Protein 7116626, Protein 29479681, Protein 30100332, and 
Protein 4821082; 

Denaturation at 94°C for 2 min, 

2 cycles at 94°C for 15 sec, 30°C for 15 sec and 72°C for 1.5 min 
10 23 cycles at 94°C for 15 sec, 55°C for 15 sec and 72°C for 1.5 min 
Reactions were concluded at 72°C for 6 minutes. 

Protein 16225006; 
Denaturation at 94°C for 2 min, 
15 25 cycles at 95°C for 15 sec, 55°C for 15 sec and 72°C for 1.5 min 
Reaction was concluded at 72°C for 6 minutes. 

Protein 4721061; 

Denaturation at 94°C for 2 min, 
20 2 cycles at 94°C for 1 5 sec, 36°C for 1 5 sec and 72°C for 1 .5 min 
23 cycles at 94°C for 15 sec, 60°C for 15 sec and 72°C for 1.5 min 
Reactions were concluded at 72°C for 6 minutes. 

Protein 26380318; 

25 Denaturation at 94°C for 2 min, 

2 cycles at 94°C for 15 sec, 38°C for 15 sec and 72°C for 1.5 min 
23 cycles at 94°C for 15 sec, 62°C for 15 sec and 72°C for 1.5 min 
Reactions were concluded at 72°C for 6 minutes. 

30 Protein 14640637; 

Denaturation at 94°C for 2 min, 

2 cycles at 94°C for 15 sec, 33°C for 15 sec and 72°C for 1.5 min 
30 cycles at 94°C for 15 sec, 55°C for 15 sec and 72°C for 1.5 min 
Reactions were concluded at 72°C for 6 minutes. 



35 
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Conditions for amplification of H. pylori ppiB; 
Denaturation at 94°C for 2 min, 

2 cycles at 94°C for 15 sec, 32°C for 15 sec and 72°C for 1.5 min 
25 cycles at 94°C for 15 sec, 56°C for 1 5 sec and 72°C for 1 .5 min 
5 Reactions were concluded at 72°C for 6 minutes 

Upon completion of thermal cycling reactions, each sample of amplified DNA 
was washed and purified using the Qiaquick Spin PCR purification kit (Qiagen, 
Gaithersburg, MD, USA). All amplified DNA samples were subjected to digestion with 

10 the restriction endonucleases, Ncol and EcoRI (New England BioLabs, Beverly, MA, 
USA), or in the case of HpSeq. 4821082 (SEQ ID NO: 1309), with Ndel and EcoRI 
(Current Protocols in Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et al., 
eds., 1994). DNA samples were then subjected to electrophoresis on 1.0 % NuSeive 
(FMC BioProducts, Rockland, ME USA) agarose gels. DNA was visualized by 

1 5 exposure to ethidium bromide and long wave uv irradiation. DNA contained in slices 
isolated from the agarose gel was purified using the Bio 101 GeneClean Kit protocol 
(Bio 101 Vista, CA, USA). 

Cloning of K pylori DNA sequences into the pET-28b prokaryotic expression vector. 

20 The pET-28b vector was prepared for cloning by digestion with Ncol and EcoRI , 

or in the case of H. pylori protein 4821082 with Ndel and EcoRI (Current Protocols in 
Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et al., eds., 1994). In the case 
of cloning ppiB, the pET-28a vector, which encodes a His-Tag that can be fused to the 
5' end of an inserted gene, was used and the cloning site prepared for cloning with the 

25 ppiB gene by digestion with BamHI and Xhol restriction endonucleases. 

Following digestion, DNA inserts were cloned (Current Protocols in Molecular 
Biology, John Wiley and Sons, Inc., F. Ausubel et al., eds., 1994) into the previously 
digested pET-28b expression vector, except for the amplified insert for ppiB, which was 
cloned into the pET-28a expression vector. Products of the ligation reaction were then 

30 used to transform the BL21 strain of K coli (Current Protocols in Molecular Biology, 
John Wiley and Sons, Inc., F. Ausubel et ?1., tds., 1994) as described below. 

Transformation of competent bacteria with recombinant plasmids 

Competent bacteria, E coli strain BL21 or E. coli strain BL21(DE3), were 
35 transformed with recombinant pET expression plasmids carrying the cloned H. pylori 
sequences according to standard methods (Current Protocols in Molecular, John Wiley 
and Sons, Inc., F. Ausubel et al., eds., 1994). Briefly, 1 microliter of ligation reaction 
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was mixed with 50 microliters of electrocompetent cells and subjected to a high voltage 
pulse, after which, samples were incubated in 0.45 milliliters SOC medium (0.5% yeast 
extract, 2.0 % tryptone, 10 raM NaCl, 2.5 mM KC1, 10 mM MgC12, 10 mM MgS04 and 
20, mM glucose) at 37°C with shaking for 1 hour. Samples were then spread on LB 
5 agar plates containing 25 microgram/ml kanamycin sulfate for growth overnight. 

Transformed colonies of BL21 were then picked and analyzed to evaluate cloned inserts 
as described below. 

Identification of recombinant pET expression plasmids carrying K pylori sequences 
1 0 Individual BL2 1 clones transformed with recombinant pET-28b-H.pylori ORFs 

were analyzed by PCR amplification of the cloned inserts using the same forward and 
reverse primers, specific for each H. pylori sequence, that were used in the original PCR 
amplification cloning reactions. Successful amplification verified the integration of the 
H. pylori sequences in the expression vector (Current Protocols in Molecular Biology, 
15 John Wiley and Sons, Inc., F. Ausubel et al., eds., 1994). 

Isolation and Preparation of plasmid DNA from BL2I transformants 

Individual clones of recombinant pET-28b vectors carrying properly cloned K 
pylori ORFs were picked and incubated in 5 mis of LB broth plus 25 microgram/ml 
20 kanamycin sulfate overnight. The following day plasmid DNA was isolated and 

purified using the Qiagen plasmid purification protocol (Qiagen Inc., Chatsworth, CA, 
USA). 

Expression of recombinant H. pylori sequences in E. coli 

25 The pET vector can be propagated in any E. coli K- 1 2 strain e.g. HMS 1 74, 

HB 1 0 1 , JM 1 09, DH5, etc. for the purpose of cloning or plasmid preparation. Hosts for 
expression include E. coli strains containing a chromosomal copy of the gene for T7 
RNA polymerase. These hosts are lysogens of bacteriophage DE3, a lambda derivative 
that carries the lad gene, the lacUV5 promoter and the gene for T7 RNA polymerase. 

30 T7 RNA polymerase is induced by addition of isopropyl-B-D-thiogalactoside (IPTG), 
and the T7 RN A polymerase transcribes any target plasmid, such as pET-28b, carrying a 
T7 promoter and a gene of interest. Strains used include: BL21(DE3) (Studier, F.W., 
Rosenberg, A.H., Dunn, J.J., and Dubendorff, J.W. (1990) Meth. Enzymol. 1 85, 60-89). 
To express recombinant H, pylori sequences, 50 nanograms of plasmid DNA 

35 isolated as described above was used to transform competent BL21(DE3) bacteria as 
described above (provided by Novagen as part of the pET expression system kit). The 
lacZ gene (beta-galactosidase) was expressed in the pET-System as described for the H. 
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pylori recombinant constructions. Transformed cells were cultured in SOC medium for 
1 hour, and the culture was then plated on LB plates containing 25 micrograms/ml 
kanamycin sulfate. The following day, bacterial colonies were pooled and grown in LB 
medium containing kanamycin sulfate (25 micrograms/ml) to an optical density at 600 
5 nM of 0.5 to 1.0 O.D. units, at which point, 1 millimolar IPTG was added to the culture 
for 3 hours to induce gene expression of the H. pylori recombinant DNA constructions. 

After induction of gene expression with IPTG, bacteria were pelleted by 
centrifugation in a Sorvall RC-3B centrifuge at 3500 x g for 15 minutes at 4°C. Pellets 
were resuspended in 50 milliliters of cold 10 mM Tris-HCl, pH 8.0, 0.1 M NaCl and 0.1 
10 mM EDTA (STE buffer). Cells were then centrifiiged at 2000 x g for 20 min at 4°C. 
Wet pellets were weighed and frozen at -80°C until ready for protein purification. 

III. Purification of recombinant proteins from E. coli 
Analytical Methods 

15 The concentrations of purified protein preparations were quantified 

spectrophotometrically using absorbance coefficients calculated from amino acid 
content (Perkins, S.J. 1986 Eur. J. Biochem. 157, 169-180). Protein concentrations were 
also measured by the method of Bradford, M.M. (1976) Anal. Biochem. 72, 248-254, 
and Lowry, O.H., Rosebrough, N., Farr, A.L. & Randall, R.J. (1951) J. Biol. Chem. 193, 

20 pages 265-275, using bovine serum albumin as a standard. 

SDS-polyacrylamide gels (12% or 4.0 to 25 % acrylamide gradient gels) were 
purchased from BioRad (Hercules, CA, USA), and stained with Coomassie blue. 
Molecular weight markers included rabbit skeletal muscle myosin (200 kDa), E. coli (- 
galactosidase (116 kDa), rabbit muscle phosphorylase B (97.4 kDa), bovine serum 

25 albumin (66.2 kDa), ovalbumin (45 kDa), bovine carbonic anhydrase (31 kDa), soybean 
trypsin inhibitor (21.5 kDa), egg white lysozyme (14.4 kDa) and bovine aprotinin (6.5 
kDa). 

/. Purification of soluble proteins 

30 All steps were carried out at 4°C. Frozen cells were thawed, resuspended in 5 

volumes of lysis buffer (20 mM Tris, pH 7.9, 0.5 M NaCl, 5 mM imidazole with 10% 
glycerol, 0.1 % 2-mercaptoethanol, 200 ng/ ml lysozyme, 1 mM phenylmethylsulfonyl 
fluoride (PMSF), and 10 ug/ml each of leupeptin, aprotinin, pepstatin, L-I-chloro-3-[4- 
tosylamido]-7-amino-2-heptanone(TLCK), L-l-chloro-3-[4-tosylamido]-4-phenyl-2- 

35 butanone (TPCK), and soybean trypsin inhibitor, and ruptured by several passages 
through a small volume microfluidizer (Model M-l 10S, Microfluidics International 
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Corporation, Newton, MA). The resultant homogenate was made 0.1 % Brij 35, and 
centrifuged at 100,000 x g for 1 hour to yield a clear supernatant (crude extract). 

Following filtration through a 0.8 |im Supor filter (Gelman Sciences, FRG) the 
crude extract was loaded directly onto a Ni^ + " nitrilotriacetate-agarose (NTA) with a 5 
5 milliliter bed volume (Hochuli, E., Dbeli, H., and Schacheer, A. (1987) J. 

Chromatography 411, 177-184) pre-equilibrated in lysis buffer containing 10 % 
glycerol, 0.1 % Brij 35 and 1 mM PMSF. The column was washed with 250 ml (50 bed 
volumes) of lysis buffer containing 10 % glycerol, 0.1 % Brij 35, and was eluted with 
sequential steps of lysis buffer containing 10 % glycerol, 0.05 % Brij 35, 1 mM PMSF, 
10 and 20, 100, 200, and 500 mM imidazole in succession. Fractions were monitored by 
absorbance at OD280 P ea ^ fractions were analyzed by SDS-PAGE. Fractions 

containing the recombinant protein eluted at 100 mM imidazole. 

Recombinant protein 1 4640637 and proteins, beta-galactosidase (lacZ) andpeptidyl- 

1 5 prolyl cis-trans isomerase (ppiB) 

Fractions containing the recombinant proteins from the Ni 2+ -NTA-agarose 
columns were pooled and then concentrated to approximately 5 ml by centrifugal 
filtration (Centriprep-10, Amicon, MA), and loaded directly onto a 180-ml column (1.6 
X 91 cm) of Sephacryl S-100 HR gel filtration medium equilibrated in Buffer A (10 mM 

20 Hepes, pH 7.5, 1 50 mM NaCl, 0. 1 mM EGTA) and run in Buffer A at 1 8 ml/h. 

Fractions containing the recombinant protein were identified by absorbance at 280 nm 
and analyzed by SDS-PAGE. Fractions were pooled and concentrated by centrifugal 
filtration. 

25 Recombinant protein 71 1 6626 

Fractions containing the recombinant protein from the Ni 2+ -NTA-agarose 
column were pooled and dialyzed overnight against 1 liter of dialysis buffer (10 mM 
MOPS, pH 6.5, 50 mM NaCl, 0.1 mM EGTA, 0.02% Brij 35 and 1 mM PMSF). In the 
morning, a fine white precipitate was removed by centrifugation and the resulting 

30 supernatant was loaded onto an 8 ml (8 x 75 mm) MonoS high performance liquid 
chromatography column (Pharmacia Biotechnology, Inc., Piscatawiy, NJ, USA) 
equilibrated in buffer B (10 mM MOPS, pH 6.5, 0.1 mM EGTA) containing 50 mM 
NaCl. The column was washed with 10 bed volumes of buffer B containing 50 mM 
NaCl, and developed with a 50-ml linear gradient of increasing NaCl (50 to 500 mM). 

35 Recombinant protein 71 16626 eluted as a sharp peak at 300 mM NaCl. 
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2. Purification of insoluble proteins from inclusion bodies 

The following steps were carried out at 4°C. Cell pellets were resuspended in 
lysis buffer with 1 0% glycerol 200 ^g/ ml lysozymc, 5 mM EDTA, 1 mM PMSF and 0. 1 
% -mercaptoethanol. After passage through the cell disrupter, the resulting homogenate 
5 was made 0.2 % deoxycholate, stirred 10 minutes, then centrifuged at 20,000 x g, for 30 
min. The pellets were washed with lysis buffer containing 10 % glycerol, 10 mM 
EDTA, 1% Triton X-100, 1 mM PMSF and 0.1% -mercaptoethanol, followed by several 
washes with lysis buffer containing 1 M urea, 1 mM PMSF and 0. 1 % 2- 
mercaptoethanol. The resulting white pellet was composed primarily of inclusion 
10 bodies, free of unbroken cells and membranous materials. 

Recombinant proteins 26054702, 16225006, 30100332, 4721061 

The following steps were carried out at room temperature. Purified inclusion 
bodies were dissolved in 20 ml 8.0 M urea in lysis buffer with 1 mM PMSF and 0.1 % 

15 2-mercaptoethanol, and incubated at room temperature for 1 hour. Materials that did not 
dissolve were removed by centrifugation. The clear supernatant was filtered, then 
loaded onto a Ni^ + -NTA agarose column pre-equilibrated in 8.0 M urea in Lysis 
Buffer. The column was washed with 250 ml (50 bed volumes) of lysis buffer 
containing 8 M urea, 1.0 mM PMSF and 0.1 % 2-mercaptoethanol, and developed with 

20 sequential steps of lysis buffer containing 8M urea, 1 mM PMSF, 0.1 % 2- 

mercaptoethanol and 20, 100, 200, and 500 mM imidazole in succession. Fractions 
were monitored by absorbance at OD280 nm, and peak fractions were analyzed by SDS- 
PAGE. Fractions containing the recombinant protein eiuted at 100 mM imidazole. 

25 Recombinant proteins 29479681, 26380318 

The pellet containing the inclusion bodies was solubilized in buffer B containing 
8 M urea, 1 mM PMSF and 0.1 % 2-mercaptoethanol, and incubated for 1 hour at room 
temperature. Insoluble materials were removed by centrifugation at 20,000 x g for 30 
min, and the cleared supernatant was loaded onto a 1 5 ml ( 1 .6 x 7.5 cm ) SP-Sepharose 

30 column pre-equilibrated in buffer B, 6 M urea, 1 mM PMSF, 0. 1 % 2-mercaptoethanol. 
After washing the column with 10 b^d volumes, the column was developed with a linear 
gradient from 0 to 500 mM NaCl. 

Dialysis and concentration of protein samples 
35 Urea was removed slowly from the protein samples by dialysis against Tris- 

buffered saline (TBS; 10 mM Tris pH 8.0, 150 mM NaCl) containing 0.5 % 
deoxycholate (DOC) with sequential reduction in urea concentration as follows; 6M, 
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4M, 3M, 2M, 1M, 0.5 M and finally TBS without any urea. Each dialysis step was 
conducted for a minimum of 4 hours at room temperature. 

After dialysis, samples were concentrated by pressure filtration using Amicon 
stirred-cells. Protein concentrations were measured using the methods of Perkins (1986 
5 Eur. J. Biochem. 157, 169-180), Bradford ((1976) Anal. Biochem. 72, 248-254) and 
Lowry ((1951) J. Biol. Chem. 193, pages 265-275). 

The recombinant proteins purified by the methods described above are 
summarized in Table 4 below. 

10 TABLE 4 



J99 
Sequence 
Identifier 


Homolog 
identified 
by Blast 


Gene 
symbol 

of 
Homolog 


Bacterial eel! 
fraction used to 
purify 
recombinant 
proteins 


Method of 
purification 


Relative 
MWon 
SDS- 
PAGE gel 


Final 
concentratio 
n of purified 
protein 


Composit 
ionof 
buffer 


















Outer Membrane Proteins 


















16225006 


P28635 


YE AC 


Inclusion bodies 


His-Tag 


18 kDa 


5 mg/ml 


B 


















26054702 


PI 5929 


flgH 


Inclusion bodies 


His- Tag 


37 kDa 


1.18mg/ml 


B 
















as dry 
pellet 


















7116626 


P26093 


e(P4) 


Soluble fraction 


His-Tag 


29 kDa 


0.8 mg/ml 


A 














1.85 mg/ml 


C 


















29479681 


P13036 


fecA 


Inclusions 
bodies 


SP- 
Sepharose 


23 kDa 


2.36 mg/ml 


B 














0.5 mg ml 


B 
















as dry 
pellet 


















14640637 


PI 6665 


TPF1 


Soluble fraction 


His-Tag 


17 kDa 


2.4 mg/ml 


A 










gel filtration SI 00 HR 












l 





Periplasmic/Secreted Protein 



















.3? J032 


P23847 


dppA 


Inclusion bodies 


His-Tag 


11 kDa 


2.88 mg/ml 


B 


















4721061 


P36175 


GCP 


Inclusion bodies 


His-Tag 


38 kDa 


2.8 mg/ml 


B 
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Olher Surface Proteins 



















4821082 


P08089 


M 
protein 


Inclusion bodies 


His-Tag 


20 kDa 


1.16 mg/ml 


B 


















978477 


L28919 


FBP54 


Inclusion bodies 


SP- 
Sepharqse 


44 kDa 


2.56 mg/ml 


B 












0.3 mg/ml 


b 9 



Inner Membrane Proteins 



















26380318 


PI5933 


fliC 


Inclusion bodies 


SP- 
Sepharose 


11 kDa 


22 mg/ml 


B 



































Control Proteins with His-Tag 





















P00722 


lacZ 


Soluble fraction 


His-Tag 


116 kDa 


10 mg/ml 


A 










gel filtration S200 HR 
























ppiB 


Soluble fraction 


His-Tag 


21 kDa 


4.4 mg/ml 


A 










gel filtration S100HR 






Buffer 
composition 

s: 
















A- 10 mM Hepes pH 7.5, 150 mM NaCl, 0.1 mM EGTA 










B= 10 mM Tris pH 8.0, 150 mM NaCl, 0.5 % DOC 










C= 10 mM MOPS pH 6.5, 300 mM NaCl, 0.1 EGTA 










l 







5 IV. Analysis of H. pylori proteins as Vaccine candidates 

To analyze K pylori proteins for use in the vaccine formulations of the 
invention, several H. pylori proteins were expressed, characterized immunologically and 
tested in animal efficacy studies as outlined below. Specifically, the immunomodulatory 
effects of//, pylori proteins were investigated in a mouse///, pylori model which mimics 
10 the human K pylori infection in humans. In these studies, the effect of oral 
immunization of selected K pylori polypeptides in K pylori infected mice was 
determined. 

Identification, cloning and expression of recombinant Helicobacter pylori sequences. 

15 To facilitate the cloning, expression and purification of membrane and/or 

secreted proteins from H. pylori, the pET gene expression system (Novagen), for cloning 
and expression of recombinant proteins in Escherichia coli was selected. Further, for 
proteins that have a signal sequence at their amino-terminal end, a DNA sequence 
encoding a peptide tag (His-tag) was fused to the 5' end of the H. pylori DNA sequences 

20 of interest in order to facilitate purification of the recombinant protein products. 
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PCR amplification and cloning of DNA sequences containing ORFs for membrane and 
secreted proteins from the J99 strain of Helicobacter pylori. 

The sequences selected (from the list of the DNA sequences of the invention) for 
5 cloning from H. pylori strain J99 were prepared for amplification cloning by the 
polymerase chain reaction (PCR). All of the selected sequences encode for outer 
membrane K pylori proteins, with vac9 (SEQ ID NO:125), vaclO (SEQ ID NO:147), 
vac22 (SEQ ID NO: 121) and vac41 (SEQ ID NO: 176) sequences all sharing a terminal 
phenylalanine residue. Likewise, the vac32 (SEQ ID NO: 108), vac36 (SEQ ID NO: 149) 

1 0 and vac37 (SEQ ID NO: 139) sequences all share a terminal phenylalanine residue and a 
tyrosine cluster at the C-terminus. Synthetic oligonucleotide primers for each ORF of 
interest (Table 5) specific for the predicted mature 5' end of the ORF and downstream 
(3') of the predicted translational termination codon were designed and purchased 
(GibcoBRL Life Technologies, Gaithersburg, MD, USA). All forward primers (specific 

1 5 for the 5' terminus of the region of ORF of interest) were designed to include a BamlU 
restriction site followed by a Ndel restriction site. These primers were designed to 
permit the initiation of protein translation at a methionine residue encoding within the 
Ndel restriction site sequence (in the case of producing a non His-tagged recombinant 
protein) or to fuse in frame with the DNA sequence encoding the His-tag (for producing 

20 His-tagged recombinant protein), followed by the coding sequence for the remainder of 
the native K pylori DNA. AH reverse oligonucleotide primers (specific for downstream 
(3') of the predicted translational termination codon of the ORF) were designed to 
include an EcoRl restriction site at the 5' terminus. This combination of primers would 
enable each ORF of interest to be cloned into pET28b (to produce a His-tagged 

25 recombinant protein) or pET30a (to produce a non His-tagged or native recombinant 
protein). The pET28b vector provides sequence encoding an additional 20 amino- 
terminal amino acids (plus the methionine in the Ndel restriction site) including a stretch 
of six histidine residues which makes up the His-tag. 

Genomic DNA prepared from K pylori strain J99 (ATCC 55679) was used as 

30 the source of template DNA for the PCR amplification reactions (Current Protocols in 
Molecular Biology, John Wiley and Sons, Inc., F. Ausubei et aL y eds., 1994). To 
amplify a DNA sequence containing a specific H. pylori ORF, genomic DNA (50 
nanograms) was introduced into a reaction tube containing 200 nanograms of both the 
forward and reverse synthetic oligonucleotide primer specific for the ORF of interest, 

35 and 45 microliters of PCR SuperMix purchased (GibcoBRL Life Technologies, 

Gaithersburg, MD, USA) in a total of 50 microliters. The PCR SuperMix is supplied in 
1.1X concentrations and contains 22mM Tris-HCl (pH 8.4), 55mM KC1, 1.65 mM 
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MgCl2, 220 micromolar of each dATP, dCTP, dGTP and dTTP, 22 units recombinant 
Taq polymerase/ml and stabilizers. The following thermal cycling conditions were used 
to obtain amplified DNA products for each ORF using a Perkin Elmer Cetus/Gene Amp 
PCR System thermal cycler. 

5 

Table 5: Oligonucleotide primers 



Gene 


Forward primer 


Reverse primer 


vac9 
(nt SEQ ID 

NO:28) 
(aa SEQ ID 
NO: 125) 


CGCGGATCCATATGGCTGAAA 
AAACGCCTTTTTTTAAAACTAA 
AAACCAC (SEQ ID NO: 257) 


CCGGAATTCATCAGTATTCAA 
TGGGAATAAAGCC (SEQ ID 
NO: 258) 


vac 10 
(nt SEQ ID 

NO:50) 
(aa SEQ ID 
NO: 147) 


CGCGGATCCATATGAAAGAAG 
AAGAAAAAGAAGAAAAAAAG 
ACAGAAAGG (SEQ ID NO: 259) 


CCGGAATTCGCTTAAAAGAAA 
ATAGTCCCCCAAACGC (SEQ 
ID NO: 260) 


vac22 
(nt SEQ ID 

NO:24) 
(aa SEQ ID 
NO: 121) 


CGCCGGATCCATATGAAAGAG 
GTCATTCCACCCCTTCAACCCC 
(SEQ ID NO: 261) 

• 


CCGGAATTCATATAAATATCA 
TATAGGCAGAAAAAC (SEQ ID 
NO: 262) 


vac32 
(nt SEQ ID 

NO: 11) 
(aa SEQ ID 
NO: 108) 


CGCGGATCCATATGGAGGCAG 
AGCTTGATGAAAAATC (SEQ ID 
NO: 263) 


CCGGAATTCGATTGATTTTGTC 
AAATCTAAAATCCC (SEQ ID 
NO: 264) 


vac36 (hop 
B) 

(nt SEQ ID 
NO:52) 

(aa SEQ ID 
NO: 149) 


TATTATACATATGGAAGAAGA 
TGGG (SEQ ID NO: 265) 

* 


TAATCTCGAGTTTAGAAGGCG 
TA (SEQ ID NO: 266) 
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vac37 
(i-hop) 
(nt SEQ ID 

NO:42) 
(aa SEQ ID 
NO: 139) 


TTATATTCATATGGAAGACGAT 
GGC (SEQ ID NO: 267) 


AATTCTCGAGCCTCTTTATAA 
GCC (SEQ ID NO: 268) 


vac41 
(nt SEQ ID 

NO:79) 
(aa SEQ ID 

NO: 176) 


CGCGGATCCATATGGTAGAAG 
CCTTTCAAAAACACCAAAAAG 
ACGG (SEQ ID NO: 269) 


CCGGAATTCGGAGCCAATAGG 
GAGCTAAAGCC (SEQ ID NO: 
270) 



Sequences for Vac32, Vac9 and Vac22 

Denaturation at 94°C for 30 sec 

35 cycles at 94°C for 15 sec, 55°C for 15 sec, and 72°C for 1 .5 min 
5 Reactions were concluded at 72°C for 8 minutes 



Sequences for VaclO and Vac41 

Denaturation at 94°C for 30 sec 

35 cycles at 94°C for 15 sec, 55°C for 15 sec, and 72°C for 2.5 min 
10 Reactions were concluded at 72°C for 8 minutes 

Sequences for Vac36 and Vac37 
Denaturation at 

2 cycles at 94°C for 15 sec, 30°C for 15 sec, and 72°C for 1.5 min 
1 5 23 cycles at 94°C for 1 5 sec, 55°C for 1 5 sec, and 72°C for 1 .5 min 
Reactions were concluded at 72°C for 6 minutes 

Upon completion of the thermal cycling reactions, each sample of amplified 
DNA was subjected to electrophoresis on 1 .0% agarose gels. The DNA was visualized 
°0 by exposure to ethtdium bromide and long wave UV irradiation, and cut out in gel 

slices. DNA was purified using the Wizard PCR Preps Kit (Promega Corp., Madison, 
WI, USA), and then subjected to digestion with BamUl and EcoRl (Current Protocols in 
Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et al., eds., 1994). The 
digested PCR amplicon was then re-electrophoresed and purified as before. 
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Ligation of H. pylori DNA sequences into cloning vectors 

The pOK12 vector (J. VieiraandJ. Messing, Gene 100:189-194, 1991)was 
prepared for cloning for digestion with BamHl and EcoRl in the case of Vac9, 10, 22, 31 
and 32, whereas the pSU21 vector (B. Bartolome et al., Gene 102:75-78, 1991) was 
5 prepared for cloning by digestion with BamHl and EcoRl in the case of Vac 41 (Current 
Protocols in Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et ai 9 eds., 
1994). The vectors were subjected to electrophoresis on 1.0% agarose gels and purified 
using the Wizard PCR Preps kit (Promega Corp., Madison, WI, USA). Following 
ligation of the purified, digested vector and the purified, digested amplified H. pylori 

1 0 ORF, the products of the ligation reaction were transformed into E. coli JM1 09 

competent cells according to standard methods (Current Protocols in Molecular Biology, 
John Wiley and Sons, Inc., F. Ausubel et al, eds., 1994). Individual bacterial colonies 
were screened for those containing the correct recombinant plasmids by incubating in 
LB broth overnight (plus 25ug/ml kanamycin sulfate for the pOK12 based plasmids or 

15 25ug/ml chloramphenicol for the pSU21 based plasmids) followed by plasmid DNA 
preparation using the Magic Minipreps system (Promega Corp., Madison, WI, USA), 
and then analyzed by restriction digestion (Current Protocols in Molecular Biology, John 
Wiley and Sons, Inc., F. Ausubel etai, eds., 1994). 

20 Cloning of H. pylori DNA sequences into the pET28b and pET30a prokaryotic 
expression vectors 

Both the pET28b and pET30a expression vectors were prepared for cloning by 
digestion with Ndel and EcoRl (Current Protocols in Molecular Biology, John Wiley 
and Sons, Inc., F. Ausubel et al., eds., 1994). The H. pylori DNA sequences were 

25 removed from pOK12 (Vac9, 10,23 ,31 and 32) or pSU21 (Vac41) plasmid backbones by 
digestion with Ndel and EcoRl (Current Protocols in Molecular Biology, John Wiley 
and Sons, Inc., F. Ausubel et al t eds., 1994). The pET28b, pET30a and H. pylori DNA 
sequences were all electrophoresed on a 1% agarose gel and purified using the Wizard 
PCR Preps kit (Promega Corp., Madison WI, USA). Following ligation of the purified, 

30 digested expression vector and the purified, digest H. pylori DNA sequences, the 

products of the ligation reaction were transformed into E. coli JM109 competen . cells 
(Current Protocols in Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et al, 
eds., 1994). Individual bacterial colonies were screened for those containing the correct 
recombinant plasmids by preparing plasmid DNA as described above followed by 

35 analysis by restriction digestion profiles and DNA sequencing (Current Protocols in 
Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et al., eds., 1994). These 
recombinant plasmids were then used to transform specific E. coli expression strains. 
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Transformation of competent bacteria with recombinant expression plasmids 

Competent bacterial strains (BL21(DE3), BL21(DE3)pLyS, HMS174(DE3) and 
HMS 1 74(DE3)pLysS were prepared and transformed with the recombinant pET28b 
5 expression plasmids carrying the cloned H. pylori sequences according to standard 
methods (Current Protocols in Molecular Biology, John Wiley and Sons, Inc., F. 
Ausubel et ai, eds., 1994). These expression host strains contain a chromosomal copy 
of the gene for T7 RNA polymerase. These hosts are lysogens of bacteriophage DE3, a 
lambda derivate that carries the lad gene, the lacUVS promoter and the gene for T7 
10 RNA polymerase. T7 RNA polymerase expression is induced by the addition of 

isopropyl-p-D thiogalactoside (1PTG), and the T7 RNA polymerase then transcribes any 
target plasmid, such as pET28b, that carries a T7 promoter sequence and a gene of 
interest. 

1 5 Expression of recombinant H. pylori sequences in E. coli 

Transformants were collected from LB agar plates containing 25ug/ml 
kanamycin sulfate (ensures maintenance of the pET28b-based recombinant plasmids) 
and used to inoculate LB broth containing 25ug/ml kanamycin sulfate and grown to an 
optical density at 600nm of 0.5 to 1.0 OD units, at which point ImM 1PTG was added 

20 to the culture for one to three hours to induce gene expression of the H. pylori 
recombinant DNA constructions. After induction of gene expression with 1PTG, 
bacteria were pelleted by centrifugation and resuspended in SDS-PAGE solubilization 
buffer and subjected to SDS-PAGE (Current Protocols in Molecular Biology, John 
Wiley and Sons, Inc., F. Ausubel et al t eds., 1 990. Proteins were visualized by staining 

25 with Coomassie Brilliant Blue or detected by western immunoblotting using the specific 
anti-His tag monoclonal antibody (Clontech, Palo Alto, CA, USA) using standard 
methods (Current Protocols in Molecular Biology, John Wiley and Sons, Inc., F. 
Ausubel et al., eds., 1994). The host strain that provided the highest level of 
recombinant protein production was then chosen for use in a large-scale induction in 

30 order to purify the recombinant protein. All of the following proteins listed were 
expressed recombinantly and the strain giving thf highest level of expression listed: 
BL21(DE3) (vac31, vac26, vac37); BL21(DE3) pLysS (vac 9, 32); HMS174(DE3) 
(vacl0,ll). 
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Purification of recombinant proteins and generation of specific antiserum 

Large scale cultures were inoculated and grown as above, and induced with ImM 
1PTG for 3 hours. After induction, bacteria were pelleted by centrifiigation in a Sorvall 
centrifuge at 3500 x g for 1 5 min at 4°C. All of the expressed recombinant proteins 
5 were present in the insoluble inclusion body fraction. Inclusion bodies were purified 
according to standard protocols (Antibodies, Cold Spring Harbor Laboratory Press, E. 
Harlow and D. Lane, eds., 1988). The recombinant protein produced by vac32 was 
solubilized in 8M urea and partially purified by nickel chromatography (REF here). 
Denatured recombinant proteins were purified by electrophoresis on SDS-PAGE gels, 
10 and after visualization with Coomassie Brilliant Blue, the protein was excised from the 
gel and the gel slices homogenized. This material was used to raise specific polyclonal 
antibodies in mice or rabbits according to standard protocols (Antibodies, Cold Spring 
Harbor Laboratory Press, E. Harlow and D. Lane, eds., 1988). 

1 5 Immunological characterization of recombinant proteins 

In all cases where antibody was attempted to be raised, high titre antisera was 
generated, confirming the immunogenicity of the recombinant proteins. Further, these 
specific antisera were used to analyze whether the protein encoded by the cloned gene 
was expressed in K pylori. Western immunoblot analysis using standard protocols 

20 (Current Protocols in Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et al, 
eds., 1994) confirmed that the H. pylori strain J99 did express proteins of the expected 
molecular weight that reacted with the vac 10, vac32, vac3 1 , vac36 antiserum. The 
specific antiserum was also used to determine the level of antigenic conservation 
between a large number of H. pylori isolates that had been obtained from distinct 

25 geographical sites around the world, and from all types of clinical manifestations, 
including gastritis, duodenal ulcer, gastric ulcer and gastric cancer. It was found that 
every strain produced a protein that reacted specifically with each antiserum. 

Further, H. pylori cells from strains J99, 17874, AH244 and SSI were 
fractionated into different cellular compartments (Doig and Trust 1994 Infect. Immun. 

30 62:4526-4533: OToole et al 1995 J. Bacteriol. 177:6049-6057). The specific antiserum 
was used to probe .hese fractions by western immunoblot to identify in which fraction 
the protein was localized. In all cases, the immunoreactive protein was present in the 
outer membrane as had been predicted by the sequence features and motif searches 
described herein. 



35 
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Demonstration of protein efficacy as a vaccine 
Purification of vac36 for efficacy studies 

All the following steps were carried out at 4°C Cell pellets were resuspended in 
5 volumes per gram of cell of lysis buffer (50mM Sodium Phosphate pH 8.0, 0.5 M 
5 NaCl, 5mM Imidazole) with lOmM EDTA, 1 mM phenylmethylsulfonyl fluoride 

(PMSF) and 0.1 % p-mercaptoethanol, and ruptured by several passages through a small 
volume micro fluidizer (Model M-l 10S, Microfluidics International Corporation, 
Newton, MA). The resulting homogenate was made 0.2% sodium deoxycholate (DOC), 
stirred 20 minutes, then centrifiiged (10,000 g x 30 min). The pellets were washed twice 

10 with Lysis Buffer containing lOmM EDTA, 1% Triton X-100, 1 mM PMSF and 0.1% p 
-mercaptoethanol, then with lysis buffer containing 1M urea, ImM PMSF and 0.1 % P- 
mercaptoethanol. The resulting white pellet is composed primarily of inclusion bodies, 
free of unbroken cells and membranous materials. 

The inclusion bodies were dissolved in 20 ml 6M guanidine-HCl in lysis buffer 

15 with 1 mM PMSF and 0.1% P-mercaptoethanol, and incubated on ice for 1 hour. 

Materials that did not dissolve were removed by centrifugation (100,000 g x 30 min.) 
The clear supernatant was filtered through a 0.8 \im Supor filter (Gelman Sciences, 
FRG) and then load directly onto a 10 ml Ni^ + - NTA agarose column (Hochuli et al. 
1987) pre-equilibrated in 6M guanidine-HCl in Lysis Buffer containing 1 mM PMSF 

20 and 0.1% p-Mercaptoethanol. The column was washed with 20ml (2 bed volumes) of 
Lysis Buffer containing 6M guanidine-HCl, ImM PMSF and 0.1 %P-mercaptoethanol, 
then guanidine-HCl was removed slowly with a 100 ml linear gradient (from 6M to 0 M 
Guanidine-HCl) of lysis buffer containing 0.5% Brij 35, 1 mM PMSF, 0.1% p- 
mercaptochanol. Next, the column was developed with a 25 ml linear gradient of 

25 increasing imidazole (5 to 500 mM) in Lysis buffer containing 0.5% Brij 35, 1 mM 

PMSF and 0.1% P-mercaptoethanol. The recombinant proteins elute as a peak centered 
at lOOmM imidazole. 

Fractions containing the recombinant proteins were pooled and then concentrated 
to approximately 8 ml by centrifugal filtration (Centriprep-10, Amicon, MA), and 

30 loaded directly onto a 350-ml column (2.2 X 91 cm) of Sephacyl S-100 HR gel filtration 
medium equilibrated in Buffer A (50mM Sodium Phosphate, pH 8.0, 500 mM NaCl, 
0.1 mM EGTA, 1 mM PMSF, 0.1%P-mercaptoethanol, 0.5% Brij 35) and ran in Buffer 
A at 30 ml/h. Fractions containing the recombinant protein were identified by 
absorbance at 280 nm and analyzed by SDS-PAGE. Fractions were pooled, 

35 concentrated to 1 .5 to 2 mg/ml and dialysed overnight against 1 0 mM Potassium 

Phosphate pH 7.5, 150 mM NaCl, 0.1 mM EGTA and 0.5% Brij 35. The concentration 
of protein in the dialysate was quantified, then aliquoted prior to freezing at - 20°C. 
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Mouse model of Heliocobacter pylori infection 

A murine model of H. pylori infection was produced by infection of C57BL/6 
mice with with H. pylori Sydney strain SSI and was used to assess the efficacy of 
recombinant H. pylori vac36. This mouse-adapted H. pylori strain is cagA+ vacA+, 
shows colonization levels in C57BL/6 mice equivalent to those observed in humans, 
forms adhesion pedestals, colonizes for at least 8 months, and elicits a chronic-active 
gastritis and mucosal atrophy (Lee et al, Gastroenterology, 1 12:1386-1397, 1997). 
Dose-response studies have shown 100% infection rates of inbred C57BL/6 and Balb/C 
mice at 8 weeks post-challenge with a single inoculation of 10^ organisms. 

Assessment of gastric H. pylori infection 

The presence of H. pylori organisms in gastric tissue was determined by culture 
of gastric tissue and by a quantitative urease assay. In the latter method, a longitudinal 
15 segment of antrum, representing approximately Va of the total antral region was placed in 
1 ml of urea broth. After 4 hr, the extent of color change resulting from urea hydrolysis 
and increased pH was quantiated by spectrophotometric measurement of A550 (Fox et 
al, Immunol. 88:400-406, 1996). The assay sensitivity is — 10 3 K pylori organisms. A 
positive (//. /7y/or/-infected) gastric tissue was defined as that sample showing 2 
20 standard deviations above the mean A550 value derived from a group of unchallenged 
uninfected age-matched control mice. 



5 



10 



Assessment of local immune response to immunization in gastric tissue 

Longitudinal sections of gastric tissues from the esophageal to the duodenal 

25 junction were embedded in OCT embedding compound, frozen in liquid nitrogen, and 
ciyosections immunostained with monoclonal antibodies recognizing CD4 + or CD8+T 
cells or with antisera against mouse IgA for identification of IgA containing (IgACC) 
plasma cells (Pappo et al, Infect. Immun. 63:1246-1252, 1995). The degree of local 
gastric immune response was expressed quantitatively as the number of CD4 + > CD8 + or 

30 IgACC cells per mm^ of gastric region examined. 

Protective activity of purified recombinant H. pylori vac36 antigen 

The ability of purified recombinant vac36 antigen derived from H. pylori to 
interfere with the establishment of an H. pylori infection was examined in mice. Groups 
35 (n=10) of 6-8 week-old female C57BL/6 mice were immunized orally 4 times at weekly 
intervals as follows: 1) 100 fig of recombinant vac36 antigen and 10 jig cholera toxin 
(CT) adjuvant, 2) I mg H. pylori lysate antigens and 10 ^g CT, and 3) 0.2 M 
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bicarbonate buffer and 10 wg CT adjuvant. The mice were challenged 2 weeks later on 3 
consecutive days by oral administration of 10^ K pylori organisms. The experiment 
was terminated 2 weeks post-challenge, and the H. pylori infection level assessed by 
bacterial colony counts and by quantitative urease assays. 
5 Oral immunization with vac36 antigen interfered with the establishment of H. 

pylori infection upon challenge with live H. pylori organisms. Mice immunized with 
purified recombinant vac36 antigen exhibited a significantly lower level of colonization 
by H. pylori, as assessed by gastric urease activity and bacterial count assays (Table 6). 
Oral immunization with vac36 antigen also resulted in the generation of a local 
1 0 protective gastric immune response. Greater numbers of CD4 + T cells and of IgACC 
were recruited in the gastric tissues of vac36-immunized mice when compared with 
unimmunized K /?y/or/-infected mice (Table 7). 

Table 6 

1 5 Recombinant vac36 antigen protects mice from challenge with K pylori ^^^^ 



Vaccine 
Treatment 
Group 


Urease 
Activity 01 




H. pylori 
burden 0 




vac36 


0.199±0.080 


0.0022 


55,800±12,599 


0.0125 


H. pylori lysate 


0.057±0.007 


0.0002 


2,360±955 


0.0002 


buffer 


1 .655±0.420 




131,000±18,39 
1 





a Urease activity is expressed as mean A55Q± SEM of duplicate antral samples 

from n-10 mice/group, 
b by Wilcoxon Rank Sum Test compared with mice immunized with CT adjuvant 

alone 

20 c The level of H. pylori in gastric tissue was assessed by bacterial counts, and 
shown as mean colony forming unitsiSEM 
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Table 7 



vac36-immunized mice generate a local gastric immune response upon challenge with 
H. pylori 



Vaccine 

it* a /i t mo 

nt 
Group 


CD4+ 


CD8+ 


IgACC 


vac36 


cardi 
cfi 


corpu 
s 


antru 
m 


cardi 
a 


corpu 
s 


antru 
m 


cardi 
a 


corpu 
s 


antru 
m 


33 ± 
9 a 


54 ± 
8* 


31 ±8 


3±2 


0 


1 ± 1 


24 ± 
12 


79 ± 
16 


67 ± 
13 


H. 

pylori 
lysate 


31 ± 
13 


36 ± 
19 


24 ±8 


4±2 


2±1 


2±1 


31 ±9 


73 ± 
13* 


79 ± 
15 


buffer 


12 ±2 


27 + 8 


18 ±4 


1± 1 


0 


0 


4±2 


30 ± 
13 


46 ± 
14 



a Mean number of cells/mm^ of gastric region ± SEM 
5 * p<0.05 by Wilcoxon Rank Sum Test when compared with unimmunized H. 

pylori infected mice 



V. Sequence Variance Analysis of genes in Helicobacter pylori strains 
10 Four genes were cloned and sequenced from several strains of H. pylori to 

compare the DNA and deduced amino acid sequences. This information was used to 
determine the sequence variation between the H. pylori strain, J99, and other H. pylori 
strains isolated from human patients. 

1 5 Preparation of Chromosomal DNA. 

Cultures of H. pylori strains (as listed in Table 10) were grown in BLBB (1% 
Tryptone, 1% Peptamin 0.1% Glucose, 0.2% Yeast Extract 0.5% Sodium Chloride, 5% 
Fetal Bovine Serum) to an OD600 of 0.2. Cells were centrifuged in a Sorvall RC-3B at 
3500 x g at 4°C for 15 minutes and the pellet resuspended in 0.95 mis of 10 mM Tris- 

20 HC1, 0. 1 mM EDTA (TE). Lysozyrne was added to a final concentration of lmg/ml 

along with, SDS to 1% and RNAse A + Tl to 0.5mg/ml and 5 units/ml respectively, and 
incubated at 37°C for one hour. Proteinase K was then added to a final concentration of 
0.4mg/m! and the sample was incubated at 55 C for more than one hour. NaCI was 
added to the sample to a concentration of 0.65 M, mixed carefully, and 0.15 ml of 10% 
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CTAB in 0.7M NaCL (final is 1% CTAB/70mM NaCL) was added followed by 
incubation at 65°C for 20 minutes. At this point, the samples were extracted with 
chloroform:isoamyl alcohol, extracted with phenol, and extracted again with 
chloroformiisoamyl alcohol. DNA was precipitated with either EtOH (1.5 x volumes) or 
5 isopropanol (0.6 x volumes) at -70°C for lOminutes, washed in 70% EtOH and 
resuspended in TE. 

PCR Amplification and cloning. 

Genomic DNA prepared from twelve strains of Helicobacter pylori was used as 
10 the source of template DNA for PCR amplification reactions (Current Protocols in 

Molecular Biology, John Wiley and Sons, Inc., R Ausubel et al., editors, 1994). To 
. amplify a DNA sequence containing an H. pylori ORF, genomic DNA (10 nanograms) 

was introduced into a reaction vial containing 2 mM MgCl2, 1 micromolar synthetic 

oligonucleotide primers (forward and reverse primers, see Table 8) complementary to 
1 5 and flanking a defined H. pylori ORF, 0.2 mM of each deoxynucleotide triphosphate; 

dATP, dGTP, dCTP, dTTP and 0.5 units of heat stable DNA polymerase (Amplitaq, 

Roche Molecular Systems, Inc., Branchburg, NJ, USA) in a final volume of 20 

microliters in duplicate reactions. 

20 Table 8 

Oligonucleotide primers used for PCR amplification of H. pylori DNA sequences. 



f 1 — 1 

Outer membrane 
Proteins 


Forward primer 5 1 to 3' 


Reverse Primer 5* to 3' 


Protein 26054702 
(for strains AH4, AH 15, 
AH6 1,5294, 5640, AH18, 
and AH244) 


5'-TTAACCATGGTGAAAA 
GCGATA-3' (SEQ ID 
NO:217) 


5'-TAGAATTCGCCTCTAA 
AACTTTAG-3' (SEQ ID 
NO:218) 


Protein 26054702 

(for strains AH5, 5155, 
7958, AH24,and J99) 


5'-TTAACCATGGTGAAAA 
GCGATA-3' (SEQ ID 
NO:219) 


5'-TAGAATTCGCATAA 
CGATCAATC-3' (SEQ ID 
NO:220) 


Protein 7116626 


S'-ATATCCATGGTGAGTT 
TGATGA-3' (SEQ ID 
NO:221) 


5'-ATGAATTCAATTTT 
TTA'rnTGCCA-3' (SEQ ID 
NO:222) 


Protein 29479681 


5'-AATTCCATGGCTATC 
CAAATCCG-3* (SEQ ID 
NO:223) 


5 f -ATGAATTCGCCAAAA 
TCGTAGTATT-3 ' (SEQ ID 
NO:224) 


Protein 346 


5'-GATACCATGGAATTT 
ATGAAAAAG-3 1 (SEQ ID 
NO:225) 


5'-TGAATTCGAAAAAGTG 
TAGTTATAC-3' (SEQ ID 
NO:226) | 
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The following thermal cycling conditions were used to obtain amplified DNA 
products for each ORF using a Perkin Elmer Cetus/ GeneAmp PCR System 9600 
thermal cycler: 

5 Protein 71 16626 and Protein 346; 
Denaturation at 94°C for 2 min, 

2 cycles at 94°C for 15 sec, 30°C for 15 sec and 72°C for 1.5 min 
23 cycles at 94°C for 15 sec, 55°C for 15 sec and 72°C for 1.5 min 
Reactions were concluded at 72°C for 6 minutes. 

10 

Protein 26054702 for strains AH5, 5155, 7958, AH24,and J99; 
Denaturation at 94°C for 2 min, 

2 cycles at 94°C for 1 5 sec, 30°C for 1 5 sec and 72°C for 1 .5 min 
25 cycles at 94°C for 15 sec, 55°C for 15 sec and 72°C for 1 .5 min 
15 Reaction was concluded at 72°C for 6 minutes. 



Protein 26054702 and Protein 294796813 for strains AH4, AH15, AH61, 5294, 5640, 
AH 18, and Hp244 ; 
Denaturation at 94°C for 2 min, 
20 2 cycles at 94°C for 1 5 sec, 30°C for 20 sec and 72°C for 2 min 
25 cycles at 94°C for 1 5 sec, 55°C for 20 sec and 72°C for 2 min 
Reactions were concluded at 72°C for 8 minutes. 



Upon completion of thermal cycling reactions, each pair of samples were 
25 combined and used directly for cloning into the pCR cloning vector as described below. 

Cloning of H. pylori DNA sequences into the pCR TA cloning vector. 

All amplified inserts were cloned into the pCR 2.1 vector by the method 
described in the Original TA cloning kit (Invitrogen, San Diego, CA). Products of the 
30 ligation reaction were then used to transform the TOPI OF' (INVaF' in the case of K 
pylori sequence 350) strain of E. coli as described below. 



Transformation of competent bacteria with recombinant plasmids 

Competent bacteria, E coli strain TOPI OF' or E. coli strain INVaF' were 
35 transformed with recombinant pCR expression plasmids carrying the cloned H. pylori 
sequences according to standard methods (Current Protocols in Molecular Biology, John 
Wiley and Sons, Inc., F. Ausubel et al., editors, 1994). Briefly, 2 microliters of 0.5 



WO 98/24475 



PCT/US97/22104 



-93- 

micromolar BME was added to each vial of 50 microliters of competent cells. 
Subsequently, 2 microliters of ligation reaction was mixed with the competent cells and 
incubated on ice for 30 minutes. The ceils and ligation mixture were then subjected to a 
"heat shock' 1 at 42°C for 30 seconds, and were subsequently placed on ice for an 
5 additional 2 minutes, after which, samples were incubated in 0.45 milliliters SOC 
medium (0.5% yeast extract, 2.0 % tryptone, 10 mM NaCl, 2.5 mM KC1, 10 mM 
MgC12, 10 mM MgS04 and 20, mM glucose) at 37°C with shaking for 1 hour. Samples 
were then spread on LB agar plates containing 25 microgram/ml kanamycin sulfate or 
100 micrograms/ml ampicillan for growth overnight. Transformed colonies of TOPI OF' 
10 or INVaF 1 were then picked and analyzed to evaluate cloned inserts as described below. 

Identification of recombinant PCR plasmids carrying H. pylori sequences 

Individual TOP10F' or INVaF' clones transformed with recombinant pCR- 
H.pylori ORFs were analyzed by PCR amplification of the cloned inserts using the same 

1 5 forward and reverse primers, specific for each H. pylori sequence, that were used in the 
original PCR amplification cloning reactions. Successful amplification verified the 
integration of the H. pylori sequences in the cloning vector (Current Protocols in 
Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et al., editors, 1994). 

Individual clones of recombinant pCR vectors carrying properly cloned K pylori 

20 ORFs were picked for sequence analysis. Sequence analysis was performed on ABI 
Sequencers using standard protocols (Perkin Elmer) using vector-specific primers (as 
found in PCRII or pCR2.1, Invitrogen, San Diego, CA) and sequencing primers specific 
to the ORF as listed in Table 9 below. 
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Table 9 

Oligonucleotide primers used for sequencing of H. pylori DNA sequences. 



Outer membrane 
Proteins 


Forward primers 5' to 3* 


Reverse Primers 5' to 3* 


Protein 26054702 


5'CCCTTCATTTTAGAAATC 

G-3'(SEQIDNO:227) 

5 ' ATTTC A ACC AATTC AAT 

GCG-3' (SEQ ID NO:228) 

5'GCCCCTTTTGATTTGAA 

GCT-3' (SEQ ID NO:229) 

5TCGCTCCAAGATACCAA 

GAAGT-3 1 (SEQ ID NO:230) 

5'CTTGAATTAGGGGCAAA 

GATCG-3' (SEQ ID NO:23 1) 

5'ATGCGTTTTTACCCAAA 

GAAGT-3* (SEQ ID NO:232) 

5'ATAACGCCACTTCCTTA 

TTGGT-3' (SEQ ID NO:233) 


5'CTTTGGGTAAAAACGCATC 
-3' (SEQ ID NO:234) 
5'CGATCTTTGATCCTAATTC 
A-3' (SEQ ID NO:235) 
5'ATCAAGTTGCCTATGCTGA 
-3* (SEQ ID NO:236) 








Protein 7116626 


5'TTGAACACTTTTGATTAT 
GCGG-3' (SEQ ID NO:237) 
S'GGATTATGCGATTGTTTT 
ACAAG-3' (SEQ ID NO:238) 


5'GTCTTTAGCAAAAATGGCG 
TC-3' (SEQ ID NO:239) 
5'AATGAGCGTAAGAGAGCC 
TTC-3' (SEQ ID NO:240) 


Protein 
29479681 


jtl 1 A I OUVjUO 1 A 1 1 0 1 C 

A-3' (SEQIDNO:241) 
5'AGCATGTGGGTATCCAG 
C-3' (SEQ ID NO:242) 


j AUU llUll vjV^L, 1 AA AVj Al^ 1 

-3* (SEQ ID NO:243) 
S'-CTGCCTCCACCTTTGATC- 
y (SEQ ID NO:244) 


Protein 346 


5'ACCAATATCAATTGGCA 
CT-3'(SEQ IDNO:245) 
5'ACTTGGAAAAGCTCTGC 
A-3* (SEQ ID NO:246) 


5'CTTGCTTGTCATATCTAGC- 
3'(SEQIDNO:247) 
S'-GTTGAAGTGTTGGTGCTA- 
3' (SEQ ID NO:248) 




5'CAAGCAAGTGGTTTGGT 
IT1 AG-3' (SEQ ID NO:249) 
5TGGAAAGAGCAAATCAT 
TGAAG-3' (SEQ ID NO:250) 


5'GCCCATAATCAAAAAGCC 
CAT-3'(SEQ ID NO:251) 
S'CTAAAACCAAACCACTTGC 
TTGTC-3' (SEQ ID NO:252) 








Vector Primers 


S'-GTAAAACGACGGCCAG- 
3' (SEQ IDNO:253) 


5'-CAGGAAACAGCTATGAC- 
3' (SEQ ID NO:254) 
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Results 

To establish the PCR error rate in these experiments, five individual clones of 
Protein 26054702, prepared from five separate PCR reaction mixtures from K pylori 
strain J99, were sequenced over a total length of 897 nucleotides for a cumulative total 
5 of 4485 bases of DNA sequence. DNA sequence for the five clones was compared to a 
DNA sequence obtained previously by a different method, i.e., random shotgun cloning 
and sequencing. The PCR error rate for the experiments described herein was 
determined to be 2 base changes out of 4485 bases, which is equivalent to an estimated 
error rate of less than or equal to 0.04%. 

1 0 DNA sequence analysis was performed on four different open reading frames 

identified as genes and amplified by PCR methods from a dozen different strains of the 
bacterium Helicobacter pylori. The deduced amino acid sequences of three of the four 
open reading frames that were selected for this study showed statistically significant 
BLAST homology to defined proteins present in other bacterial species. Those ORFs 

1 5 included: Protein 26054702, homologous to the val A & B genes encoding an ABC 

transporter in F. novicida; Protein 71 16626, homologous to lipoprotein e (P4) present in 
the outer membrane of H. influenzae; Protein 29479681, homologous to fecA, an outer 
membrane receptor in iron (III) dicitrate transport in E. coli. Protein 346 was identified 
as an unknown open reading frame, because it showed low homology with sequences in 

20 the public databases. 

To assess the extent of conservation or variance in the ORFs across various 
strains of H. pylori, changes in DNA sequence and the deduced protein sequence were 
compared to the DNA and deduced protein sequences found in the J99 strain of K 
pylori {see Table 10 below). Results are presented as percent identity to the J99 strain of 

25 H. pylori sequenced by random shotgun cloning. To control for any variations in the 
J99 sequence each of the four open reading frames were cloned and sequenced again 
from the J99 bacterial strain and that sequence information was compared to the 
sequence information that had been collected from inserts cloned by random shotgun 
sequencing of the J99 strain. The data demonstrate that there is variation in the DNA 

30 sequence ranging from as little as 0. 12 % difference (Protein 346, J99 strain) to 
approximately 7% change (Protein 26054702, strain AH5). The deduced protein 
sequences show either no variation ( Protein 346, strains AH 18 and AH24) or up to as 
much as 7.66% amino acid changes (Protein 26054702, Strain AH5). 
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Table 10 

Multiple Strain DNA Sequence analysis of H. pylori Vaccine Candidates 
5 J99 Protein U: 26054702 26054702 7116626 7116626 29479681 29479681 346 346 
Length of Region 

Sequenced: 248 a.a. 746 nt. 232 a.a. 96 nt. 182a.a. 548 nt. 273 a.a. 819 nt. 

10 

Strain Tested 





AA 

identity 


Nuc. 
identity 


AA 

identity 


Nuc. 
identity 


AA 

identity 


Nuc. 
identity 


AA 

identity 


Nuc. 
identity 


J99 


100.00% 


100.00% 


100.00% 


100.00% 


100.00% 


100.00% 


99.63% 


99.88% 


AH244 


95.16% 


95.04% 


n.d. 


n.d. 


99.09% 


96.71% 


98.90% 


96.45% 


AH4 


95.97% 


95.98% 


97.84% 


95.83% 


n.d. 


n.d. 


97.80% 


95.73% 


AH5 


92.34% 


93.03% 


98.28% 


96.12% 


98.91% 


96.90% 


98.53% 


95.73% 


AH15 . 


95.16% 


94.91% 


97.41% 


95.98% 


99.82% 


97.99% 


99.63% 


96.09% 


AH61 


n.d. 


n.d. 


97.84% 


95.98% 


99.27% 


97.44% 


n.d. 


n.d. 


5155 


n.d. 


n.d. 


n.d. 


n.d. 


99.45% 


97.08% 


98.53% 


95.60% 


5294 


94.35% 


94.37% 


98.28% 


95.40% 


99.64% 


97.26% 


97.07% 


95.48% 


7958 


94.35% 


94.10% 


97.84% 


95.40% 


n.d. 


n.d. 


99.63% 


96.46% 


5640 


95.16% 


94.37% 


97.41% 


95.69% 


99.09% 


97.63% 


98.53% 


95.48% 


AH18 


n.d. 


n.d. 


98.71% 


95.69% 


99.64% 


97.44% 


100.00% 


95.97% 


AH24 


94.75% 


95.04% 


97.84% 


95.40% 


99.27% 


96.71% 


100.00% 


96.46% 



n.d.= not done. 

VI. Experimental Knock-Out Protocol for the Determination of Essential H. pylori 
15 Genes as Potential Therapeutic Targets 

Therapeutic targets are chosen from genes whose protein products appear to play 
key roles in essential cell pathways such as cell envelope synthesis, DNA synthesis, 
transcription, translation, regulation and colonization/virulence. 

The protocol for the deletion of portions of H. pylori genes/ORFs and the 
20 insertional mutagenesis of a kanamycin-resistance cassette in order to identify genes 
which are essential to the cell is modified from previously published methods (Labigne- 
Roussel et ah, 1988, J. Bacteriology 170, pp. 1704-1708; Cover et al.,1994, J. Biological 
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Chemistry 269, pp. 10566-10573; Reyrat et al., 1995, Proc. Natl. Acad. Sci. 92, pp 
8768-8772). The result is a gene "knock-out." 

Identification and Cloning ofH. pylori Gene Sequences 
5 The sequences of the genes or ORFs (open reading frames) selected as knock-out 

targets are identified from the H. pylori genomic sequence and used to design primers to 
specifically amplify the genes/ORFs. All synthetic oligonucleotide primers are designed 
with the aid of the OLIGO program (National Biosciences, Inc., Plymouth, MN 55447, 
USA), and can be purchased from Gibco/BRL Life Technologies (Gaithersburg, MD, 

1 0 USA). If the ORF is smaller than 800 to 1 000 base pairs, flanking primers are chosen 
outside of the open reading frame. 

Genomic DNA prepared from the Helicobacter pylori HpJ99 strain (ATCC 
55679 : deposited by Genome Therapeutics Corporation, 100 Beaver Street, Waltham, 
MA 021 54) is used as the source of template DNA for amplification of the ORFs by 

1 5 PCR (polymerase chain reaction) (Current Protocols in Molecular Biology, John Wiley 
and Sons, Inc., F. Ausubel et al., editors, 1994). For the preparation of genomic DNA 
from H. pylori, see Example I. PCR amplification is carried out by introducing 10 
nanograms of genomic HpJ99 DNA into a reaction vial containing 10 mM Tris pH 8.3, 
50 mM KG, 2 mM MgCl 2 , 2 microMolar synthetic oligonucleotide primers 

20 (forward=Fl and reverse=Rl), 0.2 mM of each deoxynucleotide triphosphate 

(dATP,dGTP, dCTP, dTTP), and 1 .25 units of heat stable DNA polymerase (Amplitaq, 
Roche Molecular Systems, Inc., Branchburg, NJ, USA) in a final volume of 40 
microliters. The PCR is carried out with Perkin Elmer Cetus/GeneAmp PCR System 
9600 thermal cyclers. 

25 Upon completion of thermal cycling reactions, each sample of amplified DNA is 

visualized on a 2% TAE agarose gel stained with Ethidium Bromide (Current Protocols 
in Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et al., editors, 1994) to 
determine that a single product of the expected size had resulted from the reaction. 
Amplified DNA is then washed and purified using the Qiaquick Spin PCR purification 

30 kit (Qiagen, Gaithersburg, MD, USA). 

PCR products a .e cloned into the pT7Blue T- Vector (catalog#69820-l, Novagen, 
Inc., Madison, WI, USA) using the TA cloning strategy (Current Protocols in Molecular 
Biology, John Wiley and Sons, Inc., F. Ausubel et al., editors, 1994). The ligation of the 
PCR product into the vector is accomplished by mixing a 6 fold molar excess of the 

35 PCR product, 10 ng of pT7Blue-T vector (Novagen), 1 microliter of T4 DNA Ligase 
Buffer (New England Biolabs, Beverly, MA, USA), and 200 units of T4 DNA Ligase 
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(New England Biolabs) into a final reaction volume of 10 microliters. Ligation is 

if 

allowed to proceed for 16 hours at 16°C. 

Ligation products are electroporated (Current Protocols in Molecular Biology, 
John Wiley and Sons, Inc., F. Ausubel et aL, editors, 1994) into electroporation- 
5 competent XL-1 Blue or DH5-a E.coli cells (Clontech Lab., Inc. Palo Alto, CA, USA). 
Briefly, 1 microliter of ligation reaction is mixed with 40 microliters of 
electrocompetent cells and subjected to a high voltage pulse (25 microFarads, 2.5 kV, 
200 ohms) after which the samples are incubated in 0.45 ml SOC medium (0.5% yeast 
extract, 2% tryptone, 10 mM NaCl, 2.5 mM KC1, 10 mM MgCl 2 , 10 mM MgS0 4 and 20 

10 mM glucose) at 37°C with shaking for 1 hour. Samples are then spread onto LB (10 g/1 
bacto tryptone, 5 g/I bacto yeast extract, 10 g/1 sodium chloride) plates containing 100 
microgram/ml of Ampicillin, 0.3% X-gal, and 100 microgram/ml IPTG. These plates 
are incubated overnight at 37°C. Ampicillin-resistant colonies with white color are 
selected, grown in 5 ml of liquid LB containing 100 microgram/ml of Ampicillin, and 

1 5 plasmid DNA is isolated using the Qiagen miniprep protocol (Qiagen, Gaithersburg, 
MD, USA). 

To verify that the correct H.pylori DNA inserts had been cloned, these pT7Blue 
plasmid DNAs are used as templates for PCR amplification of the cloned inserts, using 
the same forward and reverse primers used for the initial amplification of the J99 

20 H.pylori sequence. Recognition of the primers and a PCR product of the correct size as 
visualized on a 2% TAE, ethidium bromide stained agarose gel are confirmation that the 
correct inserts had been cloned. Two to six such verified clones are obtained for each 
knock-out target, and frozen at -70°C for storage. To minimize errors due to PCR, 
plasmid DNA from these verified clones are pooled, and used in subsequent cloning 

25 steps. 

The sequences of the genes/ORFs are again used to design a second pair of 
primers which flank the region of K pylori DNA to be either interrupted or deleted (up 
to 250 basepairs) within the ORFs but are oriented away from each other. The pool of 
circular plasmid DNAs of the previously isolated clones are used as templates for this 

30 round of PCR. Since the orientation of amplification of this pair of deletion primers is 
away from each other, the portion of the ORF between the prmers is not included in the 
resultant PCR product. The PCR product is a linear piece of DNA with H. pylori DNA 
at each end and the pT7Blue vector backbone between them which, in essence, resultes 
in the deletion of a portion of the ORFs. The PCR product is visualized on a 1% TAE, 

35 ethidium bromide stained agarose gel to confirm that only a single product of the correct 
size has been amplified. 
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A Kanamycin-resistance cassette (Labigne-Roussel et ah, 1988 J. Bacteriology 
1 70, 1 704-1 708) is ligated to this PGR product by the TA cloning method used 
previously (Current Protocols in Molecular Biology, John Wiley and Sons, Inc., F. 
Ausubel et al., editors, 1994). The Kanamycin cassette containing a Campylobacter 
5 kanamycin resistance gene is obtained by carrying out an EcoRI digestion of the 

recombinant plasmid pCTB&.kan (Cover et al.,1994, J. Biological Chemistry 269, pp. 
1 0566-10573). The proper fragment (1 .4 kb) is isolated on a 1% TAE gel, and isolated 
using the QIAquick gel extraction kit (Qiagen, Gaithersburg, MD, USA). The fragment 
is end repaired using the Klenow fill-in protocol, which involved mixing 4ug of the 

10 DNA fragment, 1 microliter of dATP,dGTP, dCTP, dTTP at 0.5 mM, 2 microliter of 
Klenow Buffer (New England Biolabs) and 5 units of Klenow DNA Polymerase I Large 
(Klenow) Fragment (New England Biolabs) into a 20 microliter reaction, incubating at 
30°C for 15 min, and inactivating the enzyme by heating to 75°C for 10 minutes. This 
blunt-ended Kanamycin cassette is then purified through a Qiaquick column (Qiagen, 

15 Gaithersburg, MD, USA) to eliminate nucleotides. The "T" overhang is then generated 
by mixing 5 micrograms of the blunt-ended kanamycin cassette, 10 mM Tris pH 8.3, 50 
mM KC1, 2 mM MgCl 2 , 5 units of DNA Polymerase (Amplitaq, Roche Molecular 
Systems, Inc., Branchburg, NJ, USA), 20 microliters of 5 mM dTTP, in a 100 microliter 
reaction and incubating the reaction for 2 hours at 37°C. The "Kan-T n cassette is 

20 purified using a QIAquick column (Qiagen, Gaithersburg, MD, USA). The PCR 

product of the deletion primers (F2 and R2) is ligated to the Kan-T cassette by mixing 
10 to 25 ng of deletion primer PCR product, 50 - 75 ng Kan-T cassette DNA, 1 
microliter lOx T4 DNA Ligase reaction mixture, 0.5 microliter T4 DNA Ligase (New 
England Biolabs, Beverly, MA, USA) in a 10 microliter reaction and incubating for 16 

25 hours at 16°C. 

The ligation products are transformed into XL-1 Blue or DH5-a Kcoli cells by 
electroporation as described previously. After recovery in SOC, cells are plated onto LB 
plates containing 100 microgram/ml Ampicillin and grown overnight at 37°C. These 
plates are then replica plated onto plates containing 25 microgram/ml Kanamycin and 

30 allowed to grow overnight. Resultant colonies have both the Ampicillin resistance gene 
present in the pT7Blue vector, md the newly introduced Kanamycin resistance gene. 
Colonies are picked into LB containing 25 microgram/ml Kanamycin and plasmid DNA 
is isolated from the cultured cells using the Qiagen miniprep protocol (Qiagen, 
Gaithersburg, MD, USA). 

35 Several tests by PCR amplification are conducted on these plasmids to verify that 

the Kanamycin is inserted in the H. pylori gene/ORF, and to determine the orientation of 
the insertion of the Kanamycin-resistance gene relative to the H. pylori gene/ORF. To 
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verify that the Kanamycin cassette is inserted into the K pylori sequence, the plasmid 
DNAs are used as templates for PCR amplification with the set of primers originally 
used to clone the H. pylori gene/ORFs. The correct PCR product is the size of the 
deleted gene/ORF but increased in size by the addition of a 1 .4 kilobase Kanamycin 
5 cassette. To avoid potential polar effects of the kanamycin resistance cassette on H, 
pylori gene expression, the orientation of the Kanamycin resistance gene with respect to 
the knock-out gene/ORF is determined and both orientations are eventually used in H. 
pylori transformations (see below). To determine the orientation of insertion of the 
kanamycin resistance gene, primers are designed from the ends of the kanamycin 

1 0 resistance gene ("Kan-l " 5'- ATCTT ACCTATC ACCTC AAAT-3 ' (SEQ ID NO:255)), 
and "Kan-2" 5'-AGACAGCAACATCTTTGTGAA-3' (SEQ ID NO:256)). By using 
each of the cloning primers in conjunction with each of the Kan primers (4 combinations 
of primers), the orientation of the Kanamycin cassette relative to the H.pylori sequence 
is determined. Positive clones are classified as either in the "A" orientation (the same 

1 5 direction of transcription is present for both the H. pylori gene and the Kanamycin 

resistance gene), or in the "B" orientation (the direction of transcription for the H.pylori 
gene is opposite to that of the Kanamycin resistance gene). Clones which share the 
same orientation (A or B) are pooled for subsequent experiments and independently 
transformed into K pylori. 

20 

Transformation of Plasmid DNA into H. pylori cells 

Two strains of//, pylori are used for transformation: ATCC 55679 . the clinical 
isolate which provided the DNA from which the H. pylori sequence database is 

25 obtained, and AH244, an isolate which had been passaged in, and has the ability to 
colonize the mouse stomach. Cells for transformation are grown at 37°C, 10% C0 2 , 
100% humidity, either on Sheep-Blood agar plates or in Brucella Broth liquid. Cells are 
grown to exponential phase, and examined microscopically to determine that the cells 
are "healthy" (actively moving cells) and not contaminated. If grown on plates, cells are 

30 harvested by scraping cells from the plate with a sterile loop, suspended in 1 ml of 
'Jrucella Broth, spun down (1 minute, top speed in eppendorf microfuge) and 
resuspended in 200 microliters Brucella Broth. If grown in Brucella Broth liquid, cells 
are centrifiiged (15 minutes at 3000 rpm in a Beckman TJ6 centrifuge) and the cell pellet 
resuspended in 200 microliters of Brucella broth. An aliquot of cells is taken to 

35 determine the optical density at 600 nm, in order to calculate the concentration of cells. 
An aliquot (1 to 5 OD$oo units/25 microliter) of the resuspended cells is placed onto a 
prewarmed Sheep-Blood agar plate, and the plate is further incubated at 37°C, 6% C0 2 , 
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100% humidity for 4 hours. After this incubation, 1 0 microliters of plasmid DNA (100 
micrograms per microliter) is spotted onto these cells. A positive control (plasmid DNA 
with the ribonuclease H gene disrupted by kanamycin resistance gene) and a negative 
control (no plasmid DNA) are done in parallel. The plates are returned to 37°C, 6% C0 2 
5 for an additional 4 hours of incubation. Cells are then spread onto that plate using a 
swab wetted in Brucella broth, and grown for 20 hours at 37°C, 6% C0 2 . Cells are then 
transferred to a Sheep-Blood agar plate containing 25 micrograms/ml Kanamycin, and 
allowed to grow for 3 to 5 days at 37°C, 6% C0 2 , 100% humidity. If colonies appear, 
they are picked and regrown as patches on a fresh Sheep-Blood agar plate containing 25 

1 0 micrograms/ml Kanamycin. 

Three sets of PCR tests are done to verify that the colonies of transformants have 
arisen from homologous recombination at the proper chromosomal location. The 
template for PCR (DNA from the colony) is obtained by a rapid boiling DNA 
preparation method as follows. An aliquot of the colony (stab of the colony with a 

15 toothpick) is introduced into 100 microliters of 1% Triton X-100, 20 mM Tris, pH 8.5, 
and boiled for 6 minutes. An equal volume of phenol : chloroform (1 : 1) is added and 
vortexed. The mixture is microfuged for 5 minutes and the supernatant is used as DNA 
template for PCR with combinations of the following primers to verify homologous 
recombination at the proper chromosomal location. 

20 TEST 1 . PCR with cloning primers originally used to amplify the gene/ORF. A 

positive result of homologous recombination at the correct chromosomal location should 
show a single PCR product whose size is expected to be the size of the deleted 
gene/ORF but increased in size by the addition of a 1 .4 kilobase Kanamycin cassette. A 
PCR product of just the size of the gene/ORF is proof that the gene had not been 

25 knocked out and that the transformant is not the result of homologous recombination at 
the correct chromosome location. 

TEST 2. PCR with F3 (primer designed from sequences upstream of the 
gene/ORF and not present on the plasmid), and either primer Kan-1 or Kan-2 (primers 
designed from the ends of the kanamycin resistance gene), depending on whether the 

30 plasmid DNA used was of "A" or "B" orientation. Homologous recombination at the 
correct chromosomal location will result in a single PCR product of the expected s ; ze 
(i.e., from the location of F3 to the insertion site of kanamycin resistance gene). No 
PCR product or PCR product(s) of incorrect size(s) will prove that the plasmid had not 
integrated at the correct site and that the gene had not been knocked out. 

35 TEST 3. PCR with R3 (primer designed from sequences downstream of the 

gene/ORF and not present on the plasmid) and either primer Kan-1 or Kan-2, depending 
on whether the plasmid DNA used was of "A" or "B" orientation. Homologous 
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recombination at the correct chromosomal location will result in a single PCR product of 
the expected size (i.e., from the insertion site of kanamycin resistance gene to the 
downstream location of R3). Again, no PCR product or PCR product(s) of incorrect 
size(s) will prove that the plasmid had not integrated at the correct site and that the gene 
5 had not been knocked out. 

Transformants showing positive results for all three tests above indicate that the 
gene is not essential for survival in vitro, 

A negative result in any of the three above tests for each transformant indicates 
that the gene had not been disrupted, and that the gene is essential for survival in vitro. 

10 In the event that no colonies result from two independent transformations while 

the positive control with the disrupted ribonuclease H plasmid DNA produces 
transformants, the plasmid DNA is further analyzed by PCR on DNA from transformant 
populations prior to plating for colony formation. This will verify that the plasmid can 
enter the cells and undergo homologous recombination at the correct site. Briefly, 

1 5 plasmid DNA is incubated according to the transformation protocol described above. 
DNA is extracted from the K pylori cells immediately after incubation with the plasmid 
DNAs and the DNA is used as template for the above TEST 2 and TEST 3. Positive 
results in TEST 2 and TEST 3 would verify that the plasmid DNA could enter the cells 
and undergo homologous recombination at the correct chromosomal location. If TEST 

20 2 and TEST 3 are positive, then failure to obtain viable transformants indicates that the 
gene is essential, and cells suffering a disruption in that gene are incapable of colony 
formation. 

VII. High-throughput drug screen assay 
25 Cloning, expression and protein purification 

Cloning, transformation, expression and purification of the K pylori target gene 

and its protein product,e.g., an K pylori enzyme, to be used in a high-throughput drug 

screen assay, is carried out essentially as described in Examples II and III above. 

Development and application of a screening assay for a particular K pylori gene 
30 product, peptidyl-propyl cis-trans isomerase, is described below as a specific example. 

Enzymatic Assay 

The assay is essentially as described by Fisher (Fischer, G., et.al. (1984) Biomed. 
Biochim. Acta 43:1 101-1 1 1 1). The assay measures the cis-trans isomerization of the 
Ala-Pro bond in the test peptide N-succinyl-Ala-Ala-Pro-Phe-p-nitroanilide (Sigma # S- 
35 7388, lot # 84H5805). The assay is coupled with a-chymotrypsin, where the ability of 
the protease to cleave the test peptide occurs only when the Ala-Pro bond is in trans. 
The conversion of the test peptide to the trans isomer in the assay is followed at 390 nm 
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on a Beckman Model DU-650 spectophotometer. The data are collected every second 
with an average scanning of time of 0.5 second. Assays are carried out in 35 mM 
Hepes, pH 8.0, in a final volume of 400 ul, with 10 \xM a-chymotrypsin (type 1-5 from 
bovine Pancreas, Sigma # C-7762, lot 23H7020) and 10 nM PPIase. To initiate the 
5 reaction, 10 fil of the substrate ( 2 mM N-Succinyl-Ala-Ala-Pro-Phe-p-nitroanilide in 
DMSO) is added to 390 |il of reaction mixture at room temperature. 

Enzymatic assay in crude bacterial extract. 

A 50 ml culture of Helicobacter pylori (strain J99) in Brucella broth is harvested 

10 at mid-log phase (OD 600 nm ~ 1) an ^ resuspended in lysis buffer with the following 
protease inhibitors: 1 mM PMSF, and 10 jig/ml of each of aprotinin, leupeptin, 
pepstatine, TLCK, TPCK, and soybean trypsin inhibitor. The suspension is subjected to 
3 cycles of freeze-thaw (15 minutes at -70 ° C, then 30 minutes at room temperature), 
followed by sonication (three 20 second bursts). The lysate is centrifuged (12,000 g x 

1 5 30 minutes) and the supernatant is assayed for enzymatic activity as described above. 

Many H pylori enzymes can be expressed at high levels and in an active form in 
K coli. Such high yields of purified proteins provide for the design of various high 
throughput drug screening assays. 

20 VIII. Truncated gene expression and protein production 

Identification, cloning and expression of recombinant Helicobacter pylori sequences. 

To facilitate the cloning, expression and purification of membrane proteins from 
H. pylori, the pET gene expression system (Novagen), for cloning and expression of 
recombinant proteins in Escherichia coli was selected. Further, for proteins that have a 

25 signal sequence at their amino-terminal end, a DNA sequence encoding a peptide tag 
(His-tag) was fused to the 5' end of the H pylori DNA sequences of interest in order to 
facilitate purification of the recombinant protein products. In some cases, the DNA 
sequence was cloned in frame with the glutathione-S-transferase protein to produce a 
GST-fusion protein. The vectors used in this case were the pGEX series from 

30 Pharmacia LKB (Uppsala, Sweden). 

PCR amplification and cloning of DNA sequences containing ORFs for membrane and 
secreted proteins from the J99 strain of Helicobacter pylori. 

The sequences chosen (from the list of the DNA sequences of the invention) for 
35 cloning from H pylori strain J99 were prepared for amplification cloning by the 

polymerase chain reaction (PCR). Synthetic oligonucleotide primers for the ORF of 
interest (Table 11) specific for the predicted mature 5' end of the ORF and either 
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downstream (3') of the predicted translational termination codon or at specific points 
within the coding region were designed and purchased (GibcoBRL Life Technologies, 
Gaithersburg, MD, USA). All forward primers (specific for the 5' terminus of the 
region of ORF of interest) were designed to include either a BamHl or a Ndel restriction 
5 site. These primers within the Ndel restriction site sequence were designed to permit the 
initiation of protein translation at a methionine residue (encoded within the Ndel 
restriction site sequence, in the case of producing a non His-tagged recombinant protein) 
or to fuse in frame with the DNA sequence encoding the His-tag (for producing His 
tagged recombinant protein), followed by the coding sequence for the remainder of the 

1 0 native H. pylori DNA. The primer with the BamHl restriction site was produced to fuse 
the H. pylori specific sequence in-frame with the C-terminus of the glutathione-S- 
transferase gene in the pGEX vectors (Pharmacia LKB, Uppsala, Sweden). All reverse 
oligonucleotide primers designed to include an EcoRl restriction site at the 5' terminus. 
Several reverse oligonucleotide primers were selected that would cause a truncation of 

15 the polypeptide to remove certain portions of the C-terminus, and in these cases the 
EcoM restriction site at the 5' end was followed by a translational termination codon. 
This combination of primers would enable the ORF of interest (or parts of the ORF of 
interest) to be cloned into pET28b (to produce a His-tagged recombinant protein), 
pET30a (to produce a non His tagged or native recombinant protein) or the pGEX-4T or 

20 pGEX-5X series (to produce a GST fusion protein). The pET28b vector provides 

sequence encoding an additional 20 amino-terminal amino acids (plus the methionine in 
the Ndel restriction site) including a stretch of six histidine residues which makes up the 
His-tag, whereas the pGEX vectors fuse the H. pylori protein to a 26,000Da glutathione- 
s-transferase protein. 

25 Genomic DNA prepared from H. pylori strain J99 (ATCC 55679) was used as 

the source of template DNA for the PCR amplification reactions (Current Protocols in 
Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et al. y eds., 1994). To 
amplify a DNA sequence containing a specific H. pylori ORF, genomic DNA (50 
nanograms) was introduced into a reaction tube containing 200 nanograms of both the 

30 forward and reverse synthetic oligonucleotide primer specific for the ORF of interest, 
and 45 microliters of PCR SuperMix purchased (GibcoBRL Life Technologies, 
Gaithersburg, MD, USA) in a total of 50 microliters. The PCR SuperMix is supplied in 
1.1X concentrations and contains 22mM Tris-HCl (pH 8.4), 55mM KG, 1.65mM 
MgCl2, 220 micromolar of each dATP, dCTP, dGTP and dTTP, 22units recombinant 

35 Taq polymerase/ml and stabilizers. The following thermal cycling conditions were used 
to obtain amplified DNA products for each ORF using a Perkin Elmer Cetus/GeneAmp 
PCR System thermal cycler. 
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Table 11: Oligonucleotide primers 



dene ana location 


Can iipnrp 

OCt| UCIHC 


Vac38- BamHI post signal sequence 


CGGGATCCGAAGGTGATGGTGTTTATA 
TAGG (SEQIDNO: 271) 


Vac38- Ndel post signal sequence 


CGCATATGGAAGGTGATGGTGTTTATA 
TAGGG (SEQIDNO: 272) 


Vac38- EcoRI/stop codon (removes C- 
terminal third of protein) 


GCGAATTCTCACTCTTTCCAATAGTTTG 
CTGCAGAGC (SEQ ID NO: 273) 


Vac38- EcoRI/stop codon (removes C- 
terminal 11 amino acids) 


CCGGAATTCTTAATCCCGTTTCAAATG 
GTAATAAAGG (SEQ ID NO: 274) 


Vac38- EcoRI downstream of native stop 
codon 


GCGAATTCCCTT T 1 A fl TA A A A AGTG T 
AGTTATACC (SEQ ID NO: 275) 



5 Sequences for Vac38 (full length or truncated) 

Denaturation at 94°C for 30 sec 

35 cycles at 94°C for 15 sec, 55°C for 15 sec, and 72°C for 1 .5 min 
Reactions were concluded at 72°C for 8 minutes 

10 Upon completion of the thermal cycling reactions, each sample of amplified 

DNA was subjected to electrophoresis on 1.0% agarose gels. The DNA was visualized 
by exposure to ethidium bromide and long wave UV irradiation, and cut out in gel 
slices. DNA was purified using the Wizard PCR Preps kit (Promega Corp., Madison 
WI, USA), and then subjected to digestion with BamHI and EcoRI (Current Protocols in 

» 

15 Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et al. y eds., 1994). The 
digested PCR amplicon was then re-electrophoresed and purified as before. 

Ligation of K pylori DNA sequences into cloning vectors 

The pOK12 vector (J. Vieira and J. Messing, Gene 100: 189-194, 1991) was 
20 prepared for cloning by digestion with BamHI and EcoRI or Ndel and EcoRI in the case 
of Vac41 (Current Protocols in Molecular Biology, John Wiley and Sons, Inc., F. 
Ausubel et aL, eds., 1994). The vectors were subjected to electrophoresis on 1.0% 
agarose gels and purified using the Wizard PCR Preps kit (Promega Corp., Madison WI, 
USA). Following ligation of the purified, digested vector and the purified, digested 
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amplified H. pylori ORF, the products of the ligation reaction were transformed into E. 
coli JM109 competent cells according to standard methods (Current Protocols in 
Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et ai 9 eds., 1994). Individual 
bacterial colonies were screened for those containing the correct recombinant plasmids 
5 by incubating in LB broth overnight (plus 25ug/ml kanamycin sulfate) followed by 
plasmid DNA preparation using the Magic Minipreps system (Promega Corp., Madison 
WI, USA), and then analyzed by restriction digestion (Current Protocols in Molecular 
Biology, John Wiley and Sons, Inc., F. Ausubel etai, eds., 1994). 

1 0 Cloning of K pylori DNA sequences into the pET28b t pET30a and pGEX4T-S 
prokaryotic expression vectors 

Both the pET28b and pET30a expression vectors were prepared for cloning by 
digestion with Ndel and £coRI, and the pGEX4T-3 vector was prepared for cloning by 
digestion with BamWl and EcoRl (Current Protocols in Molecular Biology, John Wiley 

1 5 and Sons, Inc., F. Ausubel et aL, eds., 1994). The H. pylori DNA sequences were 

removed from pOK12 plasmid backbones by digestion with Ndel and EcoRl or BamHl 
and EcoRl (Current Protocols in Molecular Biology, John Wiley and Sons, Inc., F. 
Ausubel et al., eds., 1994). The pET28b, pET30a, pGEX4T-3 and K pylori DNA 
sequences were all electrophoresed on a 1% agarose gel and purified using the Wizard 

20 PCR Preps kit (Promega Corp., Madison WI, USA). Following ligation of the purified, 
digested expression vector and the purified, digested K pylori DNA sequences, the 
products of the ligation reaction were transformed into E. coli JM109 competent cells 
(Current Protocols in Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et al., 
eds., 1994). Individual bacterial colonies were screened for those containing the correct 

25 recombinant plasmids by preparing plasmid DNA as described above followed by 
analysis by restriction digestion profiles and DNA sequencing (Current Protocols in 
Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et al. y eds. ? 1994). These 
recombinant plasmids were then used to transform specific E. coli expression strains. 

30 Transformation of competent bacteria with recombinant expression plasmids 

Competent bacterial strains BL21(DE3), BL2l(DE3)pLysS, HMS174(DE3) and 
HMS174(DE3)pLysS were prepared and transformed with the recombinant pET28b 
expression plasmids carrying the cloned H. pylori sequences according to standard 
methods (Current Protocols in Molecular Biology, John Wiley and Sons, Inc., F. 

35 Ausubel et al, eds., 1994). These expression host strains contain a chromosomal copy 
of the gene for T7 RNA polymerase. These hosts are lysogens of bacteriophage DE3, a 
lambda derivative that carries the /ac/gene, the lacUVS promoter and the gene for T7 
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RNA polymerase. T7 RNA polymerase expression is induced by the addition of 
isopropyl-P-D-thiogalactoside (IPTG), and the T7 RNA polymerase then transcribes any 
taget plasmid, such as pET28b, that carries a T7 promoter sequence and a gene of 
interest. 

5 Competent bacterial strains JM1 09 and DH5a were prepared and transformed 

with the recombinant pGEX4T-3 expression plasmid carrying the cloned K pylori 
sequences according to standard methods (Current Protocols in Molecular Biology, John 
Wiley and Sons, Inc., F. Ausubel et aL, eds., 1 994). 

1 0 Expression of recombinant K pylori sequences in E. coli 

Transformants were collected from LB agar plates containing 25ug/ml 
kanamycin sulfate (ensures maintenance of the pET28b-based recombinant plasmids) or 
lOOug/ml ampicillin (ensures maintenance of the pGEX4T-3 -based recombinant 
plasmids) and used to inoculate LB broth containing 25ug/ml kanamycin sulfate or 

1 5 1 OOug/ml ampicillin and grown to an optical density at 600nm of 0.5 to 1 .0 OD units, at 
which point ImM IPTG was added to the culture for one to three hours to induce gene 
expression of the H. pylori recombinant DNA constructions. After induction of gene 
expression with IPTG, bacteria were pelleted by centrifiigation and resuspended in SDS- 
PAGE solubilization buffer and subjected to SDS-PAGE (Current Protocols in 

20 Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et al y eds., 1994). Proteins 
were visualized by staining with Coomassie Brilliant Blue or detected by western 
immunoblotting using the specific anti-His tag monoclonal antibody (Clontech, Palo 
Alto, CA, USA) or the anti-GST tag antibody (Pharmacia LKB) using standard methods 
(Current Protocols in Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et al y 

25 eds., 1994). The host strain that provided the highest level of recombinant protein 
production was then chosen for use in a large-scale induction in order to purify the 
recombinant protein. The strains used were HMS1 74(DE3) (pET28b-based constructs) 
and DH5ct (pGEX4T-3 -based constructs). 

Removal of the C-terminal regions appeared in both systems to improve the level 

30 of expression, although this increase was far more prominent in the GST-fusion system. 
All recombinant proteins produced were of the predicted molecular weight based on the 
DNA sequence plus, if necessary, the size of the fusion tag. The truncated portion of the 
H. pylori protein contains some extremely hydrophobic stretches, and removal of these 
may be the reason for the increased expression. 



WO 98/24475 



PCT/US97/22104 



- 108- 



EOUIVALENTS 

Those skilled in the art will recognize, or be able to ascertain using no more than 
routine experimentation, many equivalents to the specific embodiments and methods 
5 described herein. Such equivalents are intended to be encompassed by the scope of the 
following claims. 
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CLAIMS 

5 1. An isolated nucleic acid comprising a nucleotide sequence encoding an 

H. pylori polypeptide at least about 60% homologous to an amino acid sequence 
selected from the group consisting of SEQ ID NO: 98-SEQ ID NO: 194. 

2. An isolated nucleic acid comprising a nucleotide sequence encoding an 
10 H. pylori polypeptide selected from the group consisting of SEQ ID NO: 98-SEQ ID 

NO: 194. 

3. An isolated nucleic acid which encodes an H. pylori polypeptide, 
comprising a nucleotide sequence at least about 60% homologous to a nucleotide 

15 sequence selected from the group consisting of SEQ ID NO: 1-SEQ ID NO: 97, or a 
complement thereof 

4. The isolated nucleic acid of claim I, comprising a nucleotide sequence 
selected from the group consisting of SEQ ID NO: 1-SEQ ID NO: 97, or a complement 

20 thereof. 

5. An isolated nucleic acid molecule encoding an K pylori polypeptide, 
comprising a nucleotide sequence which hybridizes under stringent hybridization 
conditions to a nucleic acid molecule comprising the nucleotide sequence selected from 

25 the group consisting of SEQ ID NO: 1 -SEQ ID NO: 97, or a complement thereof. 

6. An isolated nucleic acid comprising a nucleotide sequence of at least 8 
nucleotides in length, wherein the sequence hybridizes under stringent hybridization 
conditions to a nucleic acid having a nucleotide sequence selected from the group 

30 consisting of SEQ ID NO: 1-SEQ ID NO: 97, or a complement thereof. 

7. An isolated nucleic acid comprising a nucleotide sequence encoding an 
H. pylori cell envelope polypeptide or a fragment thereof, said nucleic acid selected 
from the group consisting of SEQ ID NO: 63, SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID 

35 NO: 9, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID 
NO: 27, SEQ ID NO: 28, SEQ ID NO: 50, SEQ ID NO: 51, SEQ ID NO: 61, SEQ ID 
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NO: 79, SEQ ID NO: 80, SEQ ID NO: 84, SEQ ID NO: 85, SEQ ID NO: 91, SEQ ID 
NO: 94! SEQ ID NO: 5, SEQ ID NO: 1 1, SEQ ID NO: 26, SEQ ID NO: 36, SEQ ID 
NO: 42! SEQ ID NO: 52, SEQ ID NO: 22, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID 
NO: 65, SEQ ID NO: 66, SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 17, SEQ ID 
5 NO: 1 8, SEQ ID NO: 19, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 38, SEQ ID 
NO: 39, SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 6, SEQ ID NO: 34, SEQ ID NO: 
35, SEQ ID NO: 60, SEQ ID NO: 69, and SEQ ID NO: 83, or a complement thereof. 

8. The isolated nucleic acid of claim 7, wherein said H. pylori cell envelope 
1 0 polypeptide or a fragment thereof is an K pylori flagella-associated polypeptide or a 

fragment thereof comrising a nucleotide sequence of SEQ ID NO: 63, or a complement 
thereof. 

9. The isolated nucleic acid of claim 7, wherein said H. pylori cell envelope 
1 5 polypeptide or a fragment thereof is an H. pylori inner membrane polypeptide or a 

fragment thereof encoded by a nucleic acid selected from the group consisting of SEQ 
ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ 
ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 38, and SEQ ID NO: 39, or a complement 
thereof. 

20 

10. The isolated nucleic acid of claim 9, wherein said H. pylori inner 
membrane polypeptide or a fragment thereof is an K pylori polypeptide or a fragment 
thereof involved in transport encoded by a nucleic acid selected from the group 
consisting of SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 17, SEQ ID NO: 18, SEQ 

25 ID NO: 19, SEQ ID NO: 43, and SEQ ID NO: 44, or a complement thereof. 

11. The isolated nucleic acid of claim 7, wherein said H. pylori cell envelope 
polypeptide or a fragment thereof is an H. pylori outer membrane polypeptide or a 
fragment thereof encoded by a nucleic acid selected from the group consisting of SEQ 

30 ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID 
NO: 23, SEQ ID NO: 24, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 50, SEQ ID 
NO: 51, SEQ ID NO: 61, SEQ ID NO: 79, SEQ ID NO: 80, SEQ ID NO: 84, SEQ ID 
NO: 85, SEQ ID NO: 91, SEQ ID NO: 94, SEQ ID NO: 5, SEQ ID NO: 1 1, SEQ ID 
NO: 26, SEQ ID NO: 36, SEQ ID NO: 42, SEQ ID NO: 52, SEQ ID NO: 22, SEQ ID 

35 NO: 29, SEQ ID NO: 30, SEQ ID NO: 65, and SEQ ID NO: 66, or a complement 
thereof. 
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1 2. The isolated nucleic acid of claim 1 1 , wherein said H. pylori outer 
membrane polypeptide or a fragment thereof is an H. pylori polypeptide having a 
terminal phenylalanine residue or a fragment thereof encoded by a nucleic acid selected 

5 from the group consisting of SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID 
NO: 1 1, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID 
NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 42, SEQ ID NO: 50, SEQ ID 
NO: 51, SEQ ID NO: 52, SEQ ID NO: 61, SEQ ID NO: 79, SEQ ID NO: 80, SEQ ID 
NO: 84, SEQ ID NO: 85, SEQ ID NO: 91, and SEQ ID NO: 94, or a complement 
10 thereof. 

13. The isolated nucleic acid of claim 12, wherein said H. pylori outer 
membrane polypeptide or a fragment thereof is an H. pylori polypeptide having a 
terminal phenylalanine residue and a C-terminal tyrosine cluster or a fragment thereof 

15 encoded by a nucleic acid selected from the group consisting of SEQ ID NO: 1 1, SEQ 
ID NO: 26, SEQ ID NO: 36, SEQ ID NO: 42, and SEQ ID NO: 52, or a complement 
thereof. 

14. An isolated nucleic acid comprising a nucleotide sequence encoding an 
20 H. pylori cell envelope polypeptide or a fragment thereof, wherein said polypeptide is 

selected from the group consisting of SEQ ID NO: 160, SEQ ID NO: 104, SEQ ID NO: 
105, SEQ ID NO: 106, SEQ ID NO: 1 10, SEQ ID NO: 1 1 1, SEQ ID NO: 120, SEQ ID 
NO: 121, SEQ ID NO: 124, SEQ ID NO: 125, SEQ ID NO: 147, SEQ ID NO: 148, SEQ 
ID NO: 158, SEQ ID NO: 176, SEQ ID NO: 177, SEQ ID NO: 181, SEQ ID NO: 182, 

25 SEQ ID NO: 188, SEQ ID NO: 191, SEQ ID NO: 102, SEQ ID NO: 108, SEQ ID NO: 
123, SEQ ID NO: 133, SEQ ID NO: 139, SEQ ID NO: 149, SEQ ID NO: 1 19, SEQ ID 
NO: 126, SEQ ID NO: 127, SEQ ID NO: 162, SEQ ID NO: 163, SEQ ID NO: 145, SEQ 
ID NO: 146, SEQ ID NO: 114, SEQ ID NO: 115, SEQ ID NO: 11 6, SEQ ID NO: 140, 
SEQ ID NO: 141, SEQ ID NO: 135, SEQ ID NO: 136, SEQ ID NO: 98, SEQ ID NO: 

30 99, SEQ ID NO: 1 03 , SEQ ID NO: 1 3 1 , SEQ ID NO: 1 32, SEQ ID NO : 1 57, SEQ ID 
NO: 166, and SEQ ID NO: 180. 

1 5. The isolated nucleic acid of claim 14, wherein said H. pylori cell 
envelope polypeptide or a fragment thereof is an H. pylori flagella-associated 

35 polypeptide or a fragment thereof comprising an amino acid sequence of SEQ ID NO: 
160. 
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16. The isolated nucleic acid of claim 14, wherein said H. pylori cell 
envelope polypeptide or a fragment thereof is an H. pylori inner membrane polypeptide 
or a fragment thereof selected from the group consisting of SEQ ID NO: 145, SEQ ID 

5 NO: 146, SEQ ID NO: 1 14, SEQ ID NO: 115, SEQ ID NO: 1 16, SEQ ID NO: 140, SEQ 
ID NO: 141, SEQ ID NO: 135, and SEQ ID NO: 136. 

17. The isolated nucleic acid of claim 1 6, wherein said K pylori inner 
membrane polypeptide or a fragment thereof is an H. pylori polypeptide or a fragment 

1 0 thereof involved in transport selected from the group consisting of SEQ ID NO: 1 45, 
SEQ ID NO: 146, SEQ ID NO: 114, SEQ ID NO: 115, SEQ ID NO: 116, SEQ ID NO: 
140, and SEQ ID NO: 141. 

18. The isolated nucleic acid of claim 1 4, wherein said K pylori cell 

15 envelope polypeptide or a fragment thereof is an H. pylori outer membrane polypeptide 
or a fragment thereof selected from the group consisting of SEQ ID NO: 104, SEQ ID 
NO: 105, SEQ ID NO: 106, SEQ ID NO: 110, SEQ ID NO: 111, SEQ ID NO: 120, SEQ 
ID NO: 121, SEQ ID NO: 124, SEQ ID NO: 125, SEQ ID NO: 147, SEQ ID NO: 148, 
SEQ ID NO: 158, SEQ ID NO: 176, SEQ ID NO: 177, SEQ ID NO: 181, SEQ ID NO: 

20 182, SEQ ID NO: 188, SEQ ID NO: 191, SEQ ID NO: 102, SEQ ID NO: 108, SEQ ID 
NO: 123, SEQ ID NO: 133, SEQ ID NO: 139, SEQ ID NO: 149, SEQ ID NO: 1 19, SEQ 
ID NO: 126, SEQ ID NO: 127, SEQ ID NO: 162, and SEQ ID NO: 163. 

19. The isolated nucleic acid of claim 1 8, wherein said K pylori outer 
25 membrane polypeptide or a fragment thereof is an H. pylori polypeptide having a 

terminal phenylalanine residue or a fragment thereof selected from the group consisting 
of SEQ ID NO: 104, SEQ ID NO: 105, SEQ ID NO: 106, SEQ ID NO: 108, SEQ ID 
NO: 110, SEQ ID NO: 1 1 1, SEQ ID NO: 120, SEQ ID NO: 121, SEQ ID NO: 123, SEQ 
ID NO: 124, SEQ ID NO: 125, SEQ ID NO: 133, SEQ ID NO: 139, SEQ ID NO: 147, 
30 SEQ ID NO: 148, SEQ ID NO: 149, SEQ ID NO: 158, SEQ ID NO: 176, SEQ ID NO: 
177, SEQ ID NO: 181, SEQ ID NO: 182, SEQ ID NO: 188, and SEQ ID NO: 191. 

20. The isolated nucleic acid of claim 1 9, wherein said H. pylori outer 
membrane polypeptide or a fragment thereof is an H. pylori polypeptide having a 

35 terminal phenylalanine residue and a C-terminal tyrosine cluster or a fragment thereof 
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selected from the group consisting of SEQ ID NO: 108, SEQ ID NO: 123, SEQ ID NO: 
133, SEQ ID NO: 139, and SEQ ID NO: 149. 

21 . An isolated nucleic acid comprising a nucleotide sequence encoding an 
5 H. pylori cytoplasmic polypeptide or a fragment thereof, said nucleic acid selected from 
the group consisting of SEQ ID NO: 57, SEQ ID NO: 58, SEQ ID NO: 86, SEQ ID NO: 
87, SEQ ID NO: 88, SEQ ID NO: 89, SEQ ID NO: 92, and SEQ ID NO: 93, or a 
complement thereof. 

10 22. The isolated nucleic acid of claim 2 1 , wherein said H. pylori cytoplasmic 

polypeptide or a fragment thereof is an K pylori polypeptide or a fragment thereof 
involved in mRNA translation, said nucleic acid selected from the group consisting of 
SEQ ID NO: 57 and SEQ ID NO: 58, or a complement thereof. 



15 23. The isolated nucleic acid of claim 2 1 , wherein said K pylori cytoplasmic 

polypeptide or a fragment thereof is an H. pylori polypeptide or a fragment thereof 
involved in genome replication, transcription, recombination and repair, said nucleic 
acid selected from the group consisting of SEQ ID NO: 86 and SEQ ID NO: 87, or a 
complement thereof. 

20 

# 

24. An isolated nucleic acid comprising a nucleotide sequence encoding an 
H. pylori cytoplasmic polypeptide or a fragment thereof, wherein said polypeptide is 
selected from the group consisting of SEQ ID NO: 1 54, SEQ ID NO: 1 55, SEQ ID NO: 
183, SEQ ID NO: 184, SEQ ID NO: 185, SEQ ID NO: 186, SEQ ID NO: 189, and SEQ 
25 ID NO: 190. 



25. The isolated nucleic acid of claim 24, wherein said K pylori cytoplasmic 
polypeptide or a fragment thereof is an K pylori polypeptide or a fragment thereof 
involved in mRNA translation selected from the group consisting of SEQ ID NO: 1 54 

30 and SEQ ID NO: 155. 

26. The isolated nucleic acid of claim 24, wherein said H. pylori cytoplasmic 
polypeptide or a fragment thereof is an K pylori polypeptide or a fragment thereof 
involved in genome replication, transcription, recombination and repair selected from 

35 the group consisting of SEQ ID NO: 183 and SEQ ID NO: 184. 
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27. An isolated nucleic acid comprising a nucleotide sequence encoding an 
H. pylori secreted polypeptide or a fragment thereof, said nucleic acid selected from the 
group consisting of SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 10, SEQ ID NO: 12, 
SEQ ID NO: 20, SEQ ID NO: 25, SEQ ID NO: 31, SEQ ID NO: 32, SEQ ID NO: 45, 

5 SEQ ID NO: 46, SEQ ID NO: 53 SEQ ID NO: 64, SEQ ID NO: 67, SEQ ID NO: 70, 
SEQ ID NO: 77, SEQ fD NO: 78, SEQ ID NO: 81, SEQ ID NO: 82, SEQ ID NO: 90, 
SEQ ID NO: 95, and SEQ ID NO: 97, or a complement thereof. 

28. An isolated nucleic acid comprising a nucleotide sequence encoding an 
10 H. pylori secreted polypeptide or a fragment thereof selected from the group consisting 

of SEQ ID NO: 100, SEQ ID NO: 101, SEQ ID NO: 107, SEQ ID NO: 109, SEQ ID 
NO: 1 17, SEQ ID NO: 122, SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID NO: 142, SEQ 
ID NO: 143, SEQ ID NO: 150 SEQ ID NO: 161, SEQ ID NO: 164, SEQ ID NO: 167, 
SEQ ID NO: 174, SEQ ID NO: 175, SEQ ID NO: 178, SEQ ID NO: 179, SEQ ID NO: 
15 187, SEQ ID NO: 192, and SEQ ID NO: 194. 

29. An isolated nucleic acid comprising a nucleotide sequence encoding an 
H. pylori cellular polypeptide or a fragment thereof, said nucleic acid selected from the 
group consisting of SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 21, SEQ ID NO: 33, 

20 SEQ ID NO: 37, SEQ ID NO: 40, SEQ ID NO: 41, SEQ ID NO: 47, SEQ ID NO: 54, 
SEQ ID NO: 55, SEQ ID NO: 56 SEQ ID NO: 59, SEQ ID NO: 62, SEQ ID NO: 68, 
SEQ ID NO: 71, SEQ ID NO: 72, SEQ ID NO: 73, SEQ ID NO: 74, SEQ ID NO: 75, 
SEQ ID NO: 76, and SEQ ID NO: 96, or a complement thereof. 

25 30. An isolated nucleic acid comprising a nucleotide sequence encoding an 

H. pylori cellular polypeptide or a fragment thereof, wherein said polypeptide is selected 
from the group consisting of SEQ ID NO: 1 12, SEQ ID NO: 113, SEQ ID NO: 1 18, 
SEQ ID NO: 130, SEQ ID NO: 134, SEQ ID NO: 137, SEQ ID NO: 138, SEQ ID NO: 
144, SEQ ID NO: 151, SEQ ID NO: 152, SEQ ID NO: 153, SEQ ID NO: 156, SEQ ID 

30 NO: 159, SEQ ID NO: 165, SEQ ID NO: 168, SEQ ID NO: 169, SEQ ID NO: 170, SEQ 
ID NO: 171, SEQ ID NO: 172, SEQ ID NO: 173, and SEQ ID NO: 193. 



35 



31. A probe comprising a nucleotide sequence consisting of at least 8 
nucleotides of a nucleotide sequence selected from the group consisting of SEQ ID NO: 
1-SEQ ID NO: 97, or a complement thereof. 
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32. A recombinant expression vector comprising the nucleic acid of any of 
claims 1, 2, 3, 4, 5, 6, 7, 14, 21, 24, 27, 28, 29 or 30 operably linked to a transcription 
regulatory element. 

5 33. A cell comprising a recombinant expression vector of claim 32. 

34. A method for producing an H pylori polypeptide comprising culturing a 
cell of claim 33 under conditions that permit expression of the polypeptide. 

10 35. The method of claim 34, further comprising purifying the polypeptide 

from the cell. 

36. A method for detecting the presence of a Helicobacter nucleic acid in a 
sample comprising: 

15 (a) contacting a sample with a nucleic acid of any of claims 6 or 3 1 

so that a hybrid can form between the probe and a Helicobacter nucleic acid in the 
sample; and 

(b) detecting the hybrid formed in step (a), wherein detection of a 
hybrid indicates the presence of a Helicobacter nucleic acid in the sample. 

20 

37. An isolated H. pylori polypeptide comprising an amino acid sequence at 
least about 60% homologous to an H. pylori polypeptide selected from the group 
consisting of SEQ ID NO: 98-SEQ ID NO: 194. 

25 38. An isolated H. pylori polypeptide which is encoded by a nucleic acid 

comprising a nucleotide sequence at least about 60% homologous to a nucleotide 
sequence selected from the group consisting of SEQ ID NO: 1-SEQ ID NO: 97. 



39. The isolated H. pylori polypeptide of claim 28, wherein said polypeptide 
30 is encoded by a nucleotide sequence selected from the group consisting of SEQ ID NO: 
1-SEQ ID NO: 97. 



35 



40. An isolated H. pylori polypeptide which is encoded by a nucleic acid 
which hybridizes under stringent hybridization conditions to a nucleic acid selected from 
the group consisting of SEQ ID NO: 1-SEQ ID NO: 97, or a complement thereof. 
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41 . An isolated H. pylori polypeptide comprising an amino acid sequence 
selected from the group consisting of SEQ ID NO: 97-SEQ ID NO: 194. 

42. An isolated H. pylori cell envelope polypeptide or a fragment thereof, 
5 wherein said polypeptide is selected from the group consisting of SEQ ID NO: 160, 

SEQ ID NO: 104, SEQ ID NO: 105, SEQ ID NO: 106, SEQ ID NO: 1 10, SEQ ID NO: 
111, SEQ ID NO: 120, SEQ ID NO: 121, SEQ ID NO: 124, SEQ ID NO: 125, SEQ ID 
NO: 147, SEQ ID NO: 148, SEQ ID NO: 158, SEQ ID NO: 176, SEQ ID NO: 177, SEQ 
ID NO: 181, SEQ ID NO: 182, SEQ ID NO: 188, SEQ ID NO: 191, SEQ ID NO: 102, 

10 SEQ ID NO: 108, SEQ ID NO: 123, SEQ ID NO: 133, SEQ ID NO: 139, SEQ ID NO: 
149, SEQ ID NO: 1 19, SEQ ID NO: 126, SEQ ID NO: 127, SEQ ID NO: 162, SEQ ID 
NO: 163, SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 114, SEQ ID NO: 115, SEQ 
ID NO: 1 16, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID NO: 135, SEQ ID NO: 136, 
SEQ ID NO: 98, SEQ ID NO: 99, SEQ ID NO: 103, SEQ ID NO: 131, SEQ ID NO: 

15 132, SEQ ID NO: 1 57, SEQ ID NO: 1 66, and SEQ ID NO: 1 80. 



43. The isolated polypeptide of claim 42, wherein said H. pylori cell 
envelope polypeptide or a fragment thereof is an H. pylori flagella-associated 
polypeptide or a fragment thereof comprising an amino acid sequence of SEQ ID 
20 NO: 160. 



44. The isolated polypeptide of claim 43, wherein said H. pylori cell 
envelope polypeptide or a fragment thereof is an H. pylori inner membrane polypeptide 
or a fragment thereof selected from the group consisting of SEQ ID NO: 145, SEQ ID 
25 NO: 146, SEQ ID NO: 1 14, SEQ ID NO: 115, SEQ ID NO: 1 16, SEQ ID NO: 140, SEQ 
ID NO: 141, SEQ ID NO: 135, SEQ ID NO: 136. 



45. The isolated polypeptide of claim 44, wherein said H. pylori inner 
membrane polypeptide or a fragment thereof is an H. pylori polypeptide or a fragment 
30 thereof involved in transport selected from the group consisting of SEQ ID NO: 145, 
SEQ ID NO: 146, SEQ ID NO: 1 14, SEQ ID NO: 115, SEQ ID NO: 1 16, SEQ ID NO: 
140, and SEQ ID NO: 141. 



35 



46. The isolated polypeptide of claim 43, wherein said H. pylori cell 
envelope polypeptide or a fragment thereof is an H. pylori outer membrane polypeptide 
or a fragment thereof selected from the group consisting of SEQ ID NO: 104, SEQ ID 
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NO: 105,SEQIDNO: 106, SEQ ID NO: 110, SEQIDNO: lll,SEQIDNO: 120, SEQ 
ID NO: 121, SEQ ID NO: 124, SEQ ID NO: 125, SEQ ID NO: 147, SEQ ID NO: 148, 
SEQ ID NO: 158, SEQ ID NO: 176, SEQ ID NO: 177, SEQ ID NO: 181, SEQ ID NO: 
182, SEQ ID NO: 188, SEQ ID NO: 191, SEQ ID NO: 102, SEQIDNO: 108, SEQ ID 
5 NO: 123, SEQ ID NO: 133, SEQ ID NO: 139, SEQ ID NO: 149, SEQ ID NO: 1 19, SEQ 
ID NO: 126, SEQ ID NO: 127, SEQ ID NO: 162, and SEQ ID NO: 163. 



47. The isolated polypeptide of claim 46, wherein said H. pylori outer 
membrane polypeptide or a fragment thereof is an H. pylori polypeptide having a 

1 0 terminal phenylalanine residue or a fragment thereof selected from the group consisting 
of SEQ ID NO: 104, SEQ ID NO: 105, SEQ ID NO: 106, SEQ ID NO: 108, SEQ ID 
NO: 110, SEQ ID NO: 111, SEQ ID NO: 120, SEQIDNO: 121, SEQ ID NO: 123, SEQ 
ID NO: 124, SEQ ID NO: 125, SEQ ID NO: 133, SEQ ID NO: 139, SEQ ID NO: 147, 
SEQIDNO: 148, SEQ ID NO: 149, SEQ ID NO: 158, SEQIDNO: 176, SEQIDNO: 

15 177, SEQ ID NO: 181, SEQ ID NO: 182, SEQIDNO: 188, and SEQ ID NO: 191. 

48. The isolated polypeptide of claim 47, wherein said H. pylori outer 
membrane polypeptide or a fragment thereof is an //. pylori polypeptide having a 
terminal phenylalanine residue and a C-terminal tyrosine cluster or a fragment thereof 

20 selected from the group consisting of SEQ ID NO: 108, SEQ ID NO: 123, SEQ ID NO: 
133, SEQ ID NO: 139, and SEQ ID NO: 149. 

49. An isolated H. pylori cell envelope polypeptide or a fragment thereof, 
wherein said polypeptide is encoded by a nucleic acid selected from the group consisting 

25 of SEQ ID NO: 63, SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 13, 
SEQ ID NO: 14, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 27, SEQ ID NO: 28, 
SEQ ID NO: 50, SEQ ID NO: 51, SEQ ID NO: 61, SEQ ID NO: 79, SEQ ID NO: 80, 
SEQ ID NO: 84, SEQ ID NO: 85, SEQ ID NO: 91, SEQ ID NO: 94, SEQ ID NO: 5, 
SEQ ID NO: 1 1, SEQ ID NO: 26, SEQ ID NO: 36, SEQ ID NO: 42, SEQ ID NO: 52, 

30 SEQ ID NO: 22, SEQ ID NO: 29, SEQ ID NO: 30, SEQ ID NO: 65, SEQ ID NO: 66, 
SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, 
SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 38, SEQ ID NO: 39, SEQ ID NO: 1, 
SEQ ID NO: 2, SEQ ID NO: 6, SEQ ID NO: 34, SEQ ID NO: 35, SEQ ID NO: 60, SEQ 
ID NO: 69, and SEQ ID NO: 83. 
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50. The isolated polypeptide of claim 49, wherein said H. pylori cell 
envelope polypeptide or a fragment thereof is an K pylori flagella-associated 
polypeptide or a fragment thereof encoded by a nucleic acid comprising a nucleotide 
sequence of SEQ ID NO: 63. 

5 

51 . The isolated polypeptide of claim 49, wherein said H. pylori cell 
envelope polypeptide or a fragment thereof is an H. pylori inner membrane polypeptide 
or a fragment thereof encoded by a nucleic acid selected from the group consisting of 
SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, 

10 SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 38, SEQ ID NO: 39. 

52. The isolated polypeptide of claim 5 1 , wherein said H. pylori inner 
membrane polypeptide or a fragment thereof is an H. pylori polypeptide or a fragment 
thereof involved in transport encoded by a nucleic acid selected from the group 

1 5 consisting of SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID NO: 1 7, SEQ ID NO: 1 8, SEQ 
ID NO: 19, SEQ ID NO: 43, and SEQ ID NO: 44. 

53. The isolated polypeptide of claim 49, wherein said H. pylori cell 
envelope polypeptide or a fragment thereof is an H. pylori outer membrane polypeptide 

20 or a fragment thereof encoded by a nucleic acid selected from the group consisting of 
SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 13, SEQ ID NO: 14, SEQ 
ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 50, SEQ 
ID NO: 5 1, SEQ ID NO: 61, SEQ ID NO: 79, SEQ ID NO: 80, SEQ ID NO: 84, SEQ 
ID NO: 85, SEQ ID NO: 91, SEQ ID NO: 94, SEQ ID NO: 5, SEQ ID NO: 1 1, SEQ ID 

25 NO: 26, SEQ ID NO: 36, SEQ ID NO: 42, SEQ ID NO: 52, SEQ ID NO: 22, SEQ ID 
NO: 29, SEQ ID NO: 30, SEQ ID NO: 65, and SEQ ID NO: 66. 

54. The isolated polypeptide of claim 53, wherein said H. pylori outer 
membrane polypeptide or a fragment thereof is an H. pylori polypeptide having a 

30 terminal phenylalanine residuf or a fragment thereof encoded by a nucleic acid selected 
from the group consisting of SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID 
NO: 1 1, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID 
NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 36, SEQ ID NO: 42, SEQ ID 
NO: 50, SEQ ID NO: 51, SEQ ID NO: 52, SEQ ID NO: 61, SEQ ID NO: 79, SEQ ID 

35 NO: 80, SEQ ID NO: 84, SEQ ID NO: 85, SEQ ID NO: 91, and SEQ ID NO: 94. 
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55. The isolated polypeptide of claim 54, wherein said H. pylori outer 
membrane polypeptide or a fragment thereof is an H. pylori polypeptide having a 
terminal phenylalanine residue and a C-terminal tyrosine cluster or a fragment thereof 
encoded by a nucleic acid selected from the group consisting of SEQ ID NO: 11, SEQ 

5 ID NO: 26, SEQ ID NO: 36, SEQ ID NO: 42, and SEQ ID NO: 52. 

56. An isolated H. pylori cytoplasmic polypeptide or a fragment thereof, 
wherein said polypeptide is selected from the group consisting of SEQ ID NO: 154, 
SEQ ID NO: 155, SEQ ID NO: 183, SEQ ID NO: 184, SEQ ID NO: 185, SEQ ID NO: 

10 186, SEQ ID NO: 189, and SEQ ID NO: 190. 

57. The isolated polypeptide of claim 56, wherein said K pylori cytoplasmic 
polypeptide or a fragment thereof is an H. pylori polypeptide or a fragment thereof 
involved in mRNA translation selected from the group consisting of SEQ ID NO: 154 

15 and SEQ ID NO: 155. 

58. The isolated polypeptide of claim 56, wherein said K pylori cytoplasmic 
polypeptide or a fragment thereof is an H. pylori polypeptide or a fragment thereof 
involved in genome replication, transcription, recombination and repair selected from 

20 the group consisting of SEQ ID NO: 1 83 and SEQ ID NO: 1 84. 

59. An isolated H. pylori cytoplasmic polypeptide or a fragment thereof, 
wherein said polypeptide is encoded by a nucleic acid selected from the group consisting 
of SEQ ID NO: 57, SEQ ID NO: 58, SEQ ID NO: 86, SEQ ID NO: 87, SEQ ID NO: 88, 

25 SEQ ID NO: 89, SEQ ID NO: 92, and SEQ ID NO: 93. 

60. The isolated polypeptide of claim 59, wherein said H. pylori cytoplasmic 
polypeptide or a fragment thereof is an K pylori polypeptide or a fragment thereof 
involved in mRNA translation, said polypeptide encoded by a nucleic acid selected from 

30 the group consisting of SEQ ID NO: 57 and SEQ ID NO: 58. 

61 . The isolated polypeptide of claim 59, wherein said K pylori cytoplasmic 
polypeptide or a fragment thereof is an K pylori polypeptide or a fragment thereof 
involved in genome replication, transcription, recombination and repair, said 

35 polypeptide encoded by a nucleic acid selected from the group consisting of SEQ ID 
NO: 86 and SEQ ID NO: 87. 
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62. An isolated H. pylori cellular polypeptide or a fragment thereof, wherein 
said polypeptide is selected from the group consisting of SEQ ID NO: 1 12, SEQ ID NO: 
113, SEQ ID NO: 1 18, SEQ ID NO: 130, SEQ ID NO: 134, SEQ ID NO: 137, SEQ ID 
5 NO: 138, SEQ ID NO: 144, SEQ ID NO: 151, SEQ ID NO: 152, SEQ ID NO: 153, SEQ 
ID NO: 156, SEQ ID NO: 159, SEQ ID NO: 165, SEQ ID NO: 168, SEQ ID NO: 169, 
SEQ ID NO: 170, SEQ ID NO: 171, SEQ ID NO: 172, SEQ ID NO: 173, and SEQ ID 
NO: 193. 

10 63. An isolated H. pylori cellular polypeptide or a fragment thereof, wherein 

said polypeptide is encoded by a nucleic acid selected from the group consisting of SEQ 
ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 21, SEQ ID NO: 33, SEQ ID NO: 37, SEQ 
ID NO: 40, SEQ ID NO: 41, SEQ ID NO: 47, SEQ ID NO: 54, SEQ ID NO: 55, SEQ 
ID NO: 56 SEQ ID NO: 59, SEQ ID NO: 62, SEQ ID NO: 68, SEQ ID NO: 71, SEQ ID 

15 NO: 72, SEQ ID NO: 73, SEQ ID NO: 74, SEQ ID NO: 75, SEQ ID NO: 76, and SEQ 
ID NO: 96. 

64. An isolated H. pylori secreted polypeptide or a fragment thereof, wherein 
said polypeptide is selected from the group consisting of SEQ ID NO: 100, SEQ ID NO: 

20 101, SEQ ID NO: 107, SEQ ID NO: 109, SEQ ID NO: 1 17, SEQ ID NO: 122, SEQ ID 
NO: 128, SEQ ID NO: 129, SEQ ID NO: 142, SEQ ID NO: 143, SEQ ID NO: 150 SEQ 
ID NO: 161, SEQ ID NO: 164, SEQ ID NO: 167, SEQ ID NO: 174, SEQ ID NO: 175, 
SEQ ID NO: 178, SEQ ID NO: 179, SEQ ID NO: 187, SEQ ID NO: 192, and SEQ ID 
NO: 194. 

25 

65. An isolated H. pylori secreted polypeptide or a fragment thereof, wherein 
said polypeptide is encoded by a nucleic acid selected from the group consisting of SEQ 
ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 20, SEQ ID 
NO: 25, SEQ ID NO: 31, SEQ ID NO: 32, SEQ ID NO: 45, SEQ ID NO: 46, SEQ ID 

30 NO: 53 SEQ ID NO: 64, SEQ ID NO: 67, SEQ ID NO: 70, SEQ ID NO: 77, SEQ ID 
NO: 78, SEQ ID NO: 81, SEQ ID NO: 82, SEQ ID NO: 90, SEQ ID NO: 95, ana SEQ 
ID NO: 97. 

66. A fusion protein comprising an H. pylori polypeptide which comprises an 
35 amino acid sequence selected from the group consisting of SEQ ID NO: 98-SEQ ID NO: 

194 operatively linked to a non-//. pylori polypeptide. 
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67. A vaccine formulation for prophylactic or therapeutic treatment of an H, 
pylori infection comprising an effective amount of at least one isolated nucleic acid of 
any of claims 1, 2, 3, 4, 5, 6, 7, 14, 21, 24, 27, 28, 29 or 30. 

5 

68. A vaccine formulation for prophylactic or therapeutic treatment of an K 
pylori infection comprising an effective amount of at least one H. pylori polypeptide or a 
fragment thereof of any of claims 37, 38, 40, 41, 42, 49, 56, 59, 62, 63, 64 or 65. . 

10 69. A vaccine formulation of claim 67, further comprising a pharmaceutical^ 

acceptable carrier. 

70. A vaccine formulation of claim 68, further comprising a pharmaceutical^ 
acceptable carrier. 

15 

71. A vaccine formulation of claim 69, wherein the pharmaceutically 
acceptable carrier comprises an adjuvant. 

72. A vaccine formulation of claim 70, wherein the pharmaceutically 
20 acceptable carrier comprises an adjuvant. 

73. A vaccine formulation of claim 69, wherein the pharmaceutically 
acceptable carrier comprises a delivery system. 

25 74. A vaccine formulation of claim 70, wherein the pharmaceutically 

acceptable carrier comprises a delivery system. 

75. A vaccine formulation of claim 73, wherein the delivery system 
comprises a live vector. 

30 

76. A vaccine formulation of claim 74, wherein the delivery system 
comprises a live vector. 



35 



77. 

a virus. 



A vaccine formulation of claim 75, wherein the live vector is a bacteria or 
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78. A vaccine formulation of claim 76, wherein the live vector is a bacteria or 

a virus. 

79. A vaccine formulation of claim 73, wherein the pharmaceutical ly 
5 acceptable carrier further comprises an adjuvant. 

80. A vaccine formulation of claim 74, wherein the pharmaceutical^ 
acceptable carrier further comprises an adjuvant. 

81. A vaccine formulation for prophylactic or therapeutic treatment of an H. 
pylori infection comprising an effective amount of at least one isolated nucleic acid 
encoding an H. pylori outer membrane polypeptide or a fragment thereof, said nucleic 
acid selected from the group consisting of SEQ ID NO: 28, SEQ ID NO: 50, SEQ ID 
NO: 24, SEQ ID NO: 1 1, SEQ ID NO: 52, SEQ ID NO: 42 and SEQ ID NO: 79. 

82. The vaccine formulation of claim 81 , wherein said nucleic acid comprises 
a nucleotide sequence of SEQ ID NO: 52. 

83. A vaccine formulation for prophylactic or therapeutic treatment of an K 
20 pylori infection comprising an effective amount of at least one K pylori outer membrane 

polypeptide or a fragment thereof, said polypeptide selected from the group consisting of 
SEQ ID NO: 125, SEQ ID NO: 147, SEQ ID NO: 121, SEQ ID NO: 108, SEQ ID NO: 
149, SEQ ID NO: 139 and SEQ ID NO: 176. 

25 84. The vaccine formulation of claim 8 1 , wherein said polypeptide comprises 

an amino acid sequence of SEQ ID NO: 149. 

85. A vaccine formulation of claims 8 1 or 83, further comprising a 
pharmaceutically acceptable carrier. 

30 

86. A vaccine formulation of claim 85, wherein the pharmaceutically 
acceptable carrier comprises an adjuvant. 

87. A vaccine formulation of claim 85, wherein the pharmaceutically 
35 acceptable carrier comprises a delivery system. 
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88. A vaccine formulation of claim 87, wherein the delivery system 

♦ 

comprises a live vector. 

89. A vaccine formulation of claim 88, wherein the live vector is a bacteria or 

5 a virus. 

90. A vaccine formulation of claim 86, wherein the pharmaceutical^ 
acceptable carrier further comprises an adjuvant. 

10 91 . A method of treating or reducing a risk of H. pylori infection in a subject 

comprising administering to a subject a vaccine formulation of claim 67, such that 
treatment or reduction of risk of H. pylori infection occurs. 

92. A method of treating or reducing a risk of H. pylori infection in a subject 
15 comprising administering to a subject a vaccine formulation of claim 68, such that 

treatment or reduction of risk of K pylori infection occurs. 

93. A method of treating or reducing a risk of H. pylori infection in a subject 
comprising administering to a subject a vaccine formulation of claim 8 1 , such that 

20 treatment or reduction of risk of H. pylori infection occurs. 

94. A method of treating or reducing a risk of H. pylori infection in a subject 
comprising administering to a subject a vaccine formulation of claim 83, such that 
treatment or reduction of risk of H. pylori infection occurs. 

25 

95. A method of producing a vaccine formulation comprising: combining at 
least one isolated H. pylori polypeptide or a fragment thereof selected from the group 
consisting of SEQ ID NO: 98-SEQ ID NO: 194 with a pharmaceutical^ acceptable 
carrier to thereby form a vaccine formulation. 

30 

96. A method of producing a vaccine formulation comprising: 

(a) providing at least one isolated H. pylori polypeptide or a fragment 
thereof selected from the group consisting of SEQ ID NO: 98-SEQ ID NO: 194; and 

(b) combining at least one said isolated H. pylori polypeptide or a 
35 fragment thereof with a pharmaceutically acceptable carrier to thereby form a vaccine 

formulation. 
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97. A method of producing a vaccine formulation comprising: 

(a) culturing a cell under condition that permit expression of an H. 
pylori polypeptide or a fragment thereof selected from the group consisting of SEQ ID 

5 NO: 98-SEQ ID NO: 194; 

(b) isolating said H. pylori polypeptide or a fragment thereof from 

said cell; and 

(c) combining at least one said isolated K pylori polypeptide or a 
fragment thereof with a pharmaceutically acceptable carrier to thereby form a vaccine 

10 formulation. 

98. A chimeric H. pylori polypeptide comprising at least two H, pylori 
polypeptides or fragments thereof, wherein said polypeptides are encoded by nucleic 
acid sequences selected from the group consisting of SEQ ID NO:l-SEQ ID NO:97. 

15 

99. A chimeric H. pylori polypeptide comprising at least two H. pylori 
polypeptides or fragments thereof, wherein said polypeptides are selected from the 
group consisting of SEQ ID NO:98-SEQ ID NO: 194. 



20 
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